﻿{"id":79,"date":"2013-04-04T01:24:42","date_gmt":"2013-04-04T01:24:42","guid":{"rendered":"http:\/\/uni.hi.is\/hfv3\/?page_id=79"},"modified":"2021-03-28T16:21:42","modified_gmt":"2021-03-28T16:21:42","slug":"iii-materials","status":"publish","type":"page","link":"https:\/\/uni.hi.is\/hfv3\/iii-materials\/","title":{"rendered":"Materials"},"content":{"rendered":"<p>An overview of (additional) corpora used jointly in the LCLV19 project can be found <a href=\"https:\/\/www.arnastofnun.is\/is\/language-change-and-linguistic-variation-19th-century-icelandic-and-emergence-national-standard\">here<\/a>. These materials include:<\/p>\n<ul>\n<li>An electronic diplomatic\/facsimile edition of <em>nineteenth- and early twentieth-century private letters<\/em> (approx. 1 million words).<\/li>\n<li>An electronic corpus of <em>newspapers and periodicals<\/em> (approx. 1.4 million words).<\/li>\n<li>The <em>Icelandic<\/em> <em>parsed historical corpus<\/em> (<a href=\"http:\/\/www.linguist.is\/icelandic_treebank\/Icelandic_Parsed_Historical_Corpus_%28IcePaHC%29\">IcePaHC<\/a>), mainly comprised of narrative and religious prose\/fiction (1 million words, approx. 100,000 per century).<\/li>\n<\/ul>\n<p>Corpora (being) developed as a part of the present PhD project:<\/p>\n<ul>\n<li><em>Icelandic Corpus of Early Nineteenth-Century Correspondence<\/em> (<a href=\"https:\/\/github.com\/heimirfreyr\/ICENCC\">ICENCC<\/a>). An electronic rendition of a collection of diplomatic and semi-normalised editions of private letters using <a href=\"https:\/\/code.google.com\/p\/tesseract-ocr\/\">Google Tesseract-OCR<\/a> and <a href=\"http:\/\/bin.arnastofnun.is\/skrambi\/\">Skrambi<\/a> for post-correction. ICENCC was intended as an extension to the <a href=\"http:\/\/www.arnastofnun.is\/page\/LCLV19_sources\">LCLV19 letter corpus<\/a> of data up until the middle of the nineteenth century and currently consists of 670 letters written by 26 scribes, approx. 425,000 words. The text of the letters is available on <a href=\"https:\/\/github.com\/heimirfreyr\/ICENCC\">GitHub<\/a>, along with a letter inventory.<\/li>\n<li><em>Corpus of Reykjav\u00edk Grammar School Essays<\/em> (<a href=\"https:\/\/github.com\/heimirfreyr\/RLSS\/tree\/master\/transcriptions\/1847-1848\">1847-48<\/a>, 1852, 1855, 1860-61, 1875, 1890). A partial, experimental XML\/TEI-based version of the text (1847-48) along with corrections by the teacher(s) of grammar, punctuation and style, transcribed using the <a href=\"http:\/\/www.histei.info\/p\/home.html\">HisTEI<\/a> framework (see examples\/screenshots below). The transcriptions are freely available on <a href=\"https:\/\/github.com\/heimirfreyr\/RLSS\">GitHub<\/a>, along with the photographed samples from 1852, 1855, 1860-61, 1875 and 1890. In addition, a sample Corpus of Corrections based on this material will soon be available in CSV format, along with references to the photographs.<\/li>\n<\/ul>\n<p style=\"padding-left: 60px\">\u2013 <em>Feel free to join in! Over a thousand pages over at GitHub just waiting to be transcribed.<br \/>\n<\/em><\/p>\n<p>&nbsp;<\/p>\n<h2 style=\"text-align: center\">Sample screenshots of coded\/transcribed data<br \/>\n(oXygen XML Editor, HisTEI)<\/h2>\n<h3><\/h3>\n<h3 style=\"text-align: left\">I. Person list (student id)<\/h3>\n<p>&nbsp;<\/p>\n<h6><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-371 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile.png\" alt=\"\" width=\"617\" height=\"192\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile.png 759w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile-300x93.png 300w\" sizes=\"auto, (max-width: 617px) 100vw, 617px\" \/><\/a><\/h6>\n<blockquote><p>&lt;person xml:id=\"person_nth_yzd_4p\"&gt; &lt;persName&gt; &lt;forename&gt;\u00c1rni&lt;\/forename&gt; &lt;surname&gt;Bjarnason&lt;\/surname&gt; &lt;surname&gt;Thorsteinsson&lt;\/surname&gt; &lt;\/persName&gt; &lt;sex value=\"1\"\/&gt; &lt;occupation when=\"1896\" cert=\"high\"&gt;landf\u00f3geti \u00ed Rv\u00edk. R. og dbrm Kgk. al\u00fem.&lt;\/occupation&gt; &lt;education when=\"1847\" cert=\"high\"&gt;II&lt;\/education&gt; &lt;birth when=\"1828-04-05\"&gt;<\/p><\/blockquote>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">II. Page transcription (hand mark-up, student id)<\/h3>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/transcription.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-370 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/transcription.png\" alt=\"\" width=\"772\" height=\"235\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/transcription.png 933w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/transcription-300x91.png 300w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/transcription-768x234.png 768w\" sizes=\"auto, (max-width: 772px) 100vw, 772px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<blockquote><p>&lt;facsimile xml:base=\"transcriptions\/1847-1848\/\"&gt; &lt;graphic mimeType=\"image\/jpeg\" url=\"IMAG0613.jpg\" xml:id=\"image_046\"\/&gt;<\/p><\/blockquote>\n<h3><\/h3>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">III. Page transcription (overwriting and correcting\/underlining)<\/h3>\n<p>&nbsp;<\/p>\n<h6><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-369 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction.png\" alt=\"\" width=\"676\" height=\"114\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction.png 1276w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction-300x51.png 300w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction-768x129.png 768w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction-1024x173.png 1024w\" sizes=\"auto, (max-width: 676px) 100vw, 676px\" \/><\/a><\/h6>\n<p>Line #1: student overwriting own text; line #3: teacher correcting by underlining.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-368 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correctionHisTEI.png\" alt=\"\" width=\"590\" height=\"217\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correctionHisTEI.png 672w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correctionHisTEI-300x110.png 300w\" sizes=\"auto, (max-width: 590px) 100vw, 590px\" \/><\/p>\n<p>&nbsp;<\/p>\n<blockquote><p>h\u00e6ttulegt s\u00e9 a\u00f0 l\u00e1ta \u00feau f\u00e1 of miki\u00f0 vald yfir&lt;lb break=\"yes\"\/&gt; &lt;del hand=\"#teacher\" cause=\"fix\" confidence=\"1\" rend=\"underlining\" status=\"correction\" type=\"case-marking\"&gt;sig&lt;\/del&gt;<\/p><\/blockquote>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">IV. Page transcription (reordering by numeration in text)<\/h3>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction2.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-367 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction2.jpg\" alt=\"\" width=\"501\" height=\"82\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction2.jpg 935w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction2-300x49.jpg 300w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/correction2-768x126.jpg 768w\" sizes=\"auto, (max-width: 501px) 100vw, 501px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<blockquote><p>\u00a0og \u00fea\u00f0 \u00f3gna\u00f0i&lt;lb break=\"yes\"\/&gt; \u00feeim me\u00f0 &lt;seg xml:id=\"bk01\"&gt;gu\u00f0s&lt;\/seg&gt; &lt;metamark function=\"transposition\" target=\"#ib01\" place=\"above\"&gt;2.&lt;\/metamark&gt; &lt;seg xml:id=\"bk02\"&gt;rei\u00f0i&lt;\/seg&gt; &lt;metamark function=\"transposition\" target=\"#ib02\" place=\"above\"&gt;1.&lt;\/metamark&gt; &lt;listTranspose&gt; &lt;transpose&gt;&lt;ptr target=\"#bk02\"\/&gt;&lt;ptr target=\"#bk01\"\/&gt;&lt;\/transpose&gt; &lt;\/listTranspose&gt;<\/p><\/blockquote>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">V. Setting up document (student drop-down menu)<\/h3>\n<p>&nbsp;<\/p>\n<p><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2.png\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-373 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2.png\" alt=\"\" width=\"646\" height=\"465\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2.png 1037w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2-300x216.png 300w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2-768x553.png 768w, https:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/personfile2-1024x738.png 1024w\" sizes=\"auto, (max-width: 646px) 100vw, 646px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">VI. Further examples<\/h3>\n<p>&nbsp;<\/p>\n<p style=\"text-align: center\"><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2014\/01\/IMAG0716.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-257 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2014\/01\/IMAG0716.jpg\" alt=\"Sk\u00f3last\u00edll (1855)\" width=\"295\" height=\"394\" srcset=\"https:\/\/uni.hi.is\/hfv3\/files\/2014\/01\/IMAG0716.jpg 1836w, https:\/\/uni.hi.is\/hfv3\/files\/2014\/01\/IMAG0716-225x300.jpg 225w, https:\/\/uni.hi.is\/hfv3\/files\/2014\/01\/IMAG0716-768x1024.jpg 768w\" sizes=\"auto, (max-width: 295px) 100vw, 295px\" \/><\/a>A sample student essay (1855).<\/p>\n<p>&nbsp;<\/p>\n<p style=\"text-align: center\"><a href=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/madur.png\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-372 aligncenter\" src=\"http:\/\/uni.hi.is\/hfv3\/files\/2018\/07\/madur.png\" alt=\"\" width=\"600\" height=\"70\" \/><\/a>A correction of the generic pronoun <em>ma\u00f0ur<\/em>, teacher doubly underlines (see Vi\u00f0arsson 2017).<\/p>\n<p>&nbsp;<\/p>\n<blockquote><p>Fr\u00f3\u00f0legt v\u00e6ri a\u00f0 vita hvernig Halld\u00f3r Kr. Fri\u00f0riksson lei\u00f0beindi nemendum vi\u00f0 a\u00f0 rita m\u00f3\u00f0urm\u00e1li\u00f0, hva\u00f0 \u00fea\u00f0 var sem hann lag\u00f0i \u00e1herslu \u00e1 a\u00f0 lei\u00f0r\u00e9tta hj\u00e1 \u00feeim. Er reyndar sagt a\u00f0 f\u00e1tt hafi honum veri\u00f0 jafn-illa vi\u00f0 og a\u00f0 or\u00f0i\u00f0 <em>ma\u00f0ur<\/em> v\u00e6ri nota\u00f0 sem \u00f3\u00e1kve\u00f0i\u00f0 fornafn eins og \u00ed d\u00f6nsku. (Kjartan G. Ott\u00f3sson 1990:96)<\/p><\/blockquote>\n<p>&nbsp;<\/p>\n<h3 style=\"text-align: left\">VII. GitHub repository (1847-48 transcriptions; 1852, '55, '60-'61, '75, '90)<\/h3>\n<p style=\"text-align: left\">Please visit <a href=\"https:\/\/github.com\/heimirfreyr\/RLSS\">GitHub<\/a> to access the repository.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>An overview of (additional) corpora used jointly in the LCLV19 project can be found here. These materials include: An electronic diplomatic\/facsimile edition of nineteenth- and early twentieth-century private letters (approx. 1 million words). An electronic corpus of newspapers and periodicals (approx. 1.4 million words). The Icelandic parsed historical corpus (IcePaHC), mainly comprised of narrative and [&hellip;]<\/p>\n","protected":false},"author":1179,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-79","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/pages\/79","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/users\/1179"}],"replies":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/comments?post=79"}],"version-history":[{"count":69,"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/pages\/79\/revisions"}],"predecessor-version":[{"id":491,"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/pages\/79\/revisions\/491"}],"wp:attachment":[{"href":"https:\/\/uni.hi.is\/hfv3\/wp-json\/wp\/v2\/media?parent=79"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}