{"id":3081,"date":"2020-10-24T08:00:22","date_gmt":"2020-10-24T08:00:22","guid":{"rendered":"https:\/\/uni.hi.is\/eirikur\/?p=3081"},"modified":"2020-10-21T16:39:42","modified_gmt":"2020-10-21T16:39:42","slug":"maltaekni","status":"publish","type":"post","link":"https:\/\/uni.hi.is\/eirikur\/2020\/10\/24\/maltaekni\/","title":{"rendered":"M\u00e1lt\u00e6kni"},"content":{"rendered":"<p><em>M\u00e1lt\u00e6kni<\/em> er tilt\u00f6lulega n\u00fdlegt or\u00f0 \u00ed \u00edslensku \u2013 \u00fe\u00fd\u00f0ing \u00e1 \u00fev\u00ed sem \u00e1 ensku nefnist <em>language technology<\/em>. Einnig hefur or\u00f0i\u00f0 <em>tungut\u00e6kni<\/em> veri\u00f0 nota\u00f0 um sama hugtak. \u00cd stuttu m\u00e1li m\u00e1 segja a\u00f0 me\u00f0 m\u00e1lt\u00e6kni s\u00e9 \u00e1tt vi\u00f0 hvers kyns samvinnu tungum\u00e1ls og t\u00f6lvut\u00e6kni sem hefur einhvern hagn\u00fdtan tilgang; beinist a\u00f0 \u00fev\u00ed a\u00f0 hanna e\u00f0a \u00fatb\u00faa einhvern hugb\u00fana\u00f0 e\u00f0a t\u00e6ki sem n\u00fdtist m\u00f6nnum \u00ed starfi e\u00f0a leik. \u00deessi samvinna hefur tv\u00e6r hli\u00f0ar og felst annars vegar \u00ed notkun t\u00f6lvut\u00e6kninnar \u00ed \u00fe\u00e1gu tungum\u00e1lsins; hins vegar \u00ed notkun tungum\u00e1lsins \u00ed \u00fe\u00e1gu t\u00f6lvut\u00e6kninnar.<\/p>\n<p>\u00dea\u00f0 er h\u00e6gt a\u00f0 n\u00fdta t\u00f6lvu- og uppl\u00fdsingat\u00e6kni \u00e1 \u00fdmsan h\u00e1tt til \u00feess a\u00f0 au\u00f0velda m\u00f6nnum a\u00f0 nota tungum\u00e1li\u00f0. \u00dear m\u00e1 nefna \u00fdmiss konar lei\u00f0r\u00e9ttingarforrit fyrir stafsetningu og m\u00e1lfar. Sl\u00edkur b\u00fana\u00f0ur fylgir til d\u00e6mis algengum forritap\u00f6kkum eins og Microsoft Office og LibreOffice \u00e1 \u00fdmsum tungum\u00e1lum. Einnig er h\u00e6gt a\u00f0 s\u00e6kja vi\u00f0b\u00e6tur af \u00feessu tagi fyrir \u00fdmsa vafra. \u00cdslensk stafsetningarlei\u00f0r\u00e9ttingarforrit eru til, svo sem <a href=\"https:\/\/www.puki.is\/\">P\u00faki<\/a> og <a href=\"http:\/\/skrambi.arnastofnun.is\/\">Skrambi<\/a>, en ekkert m\u00e1lfr\u00e6\u00f0ilei\u00f0r\u00e9ttingarforrit er til fyrir \u00edslensku.<\/p>\n<p>H\u00e9r m\u00e1 einnig telja \u00fdmiss konar hj\u00e1lpart\u00e6ki handa f\u00f3lki sem \u00e1 erfitt me\u00f0 m\u00e1l e\u00f0a lestur s\u00f6kum einhvers konar f\u00f6tlunar. <em>Talgervill<\/em>, sem er b\u00fana\u00f0ur sem les upp rita\u00f0an texta, var fyrst ger\u00f0ur fyrir \u00edslensku um 1990 en <a href=\"https:\/\/www.blind.is\/is\/thjonusta\/talgervill\">n\u00fdjasti talgervillinn<\/a> kom \u00e1 marka\u00f0inn 2012. Hann var ger\u00f0ur \u00e1 vegum Blindraf\u00e9lagsins og b\u00fdr yfir tveimur r\u00f6ddum, karlmannsr\u00f6dd sem nefnist Karl og kvenmannsr\u00f6dd sem nefnist D\u00f3ra.<\/p>\n<p><em>Talgreinir<\/em> breytir t\u00f6lu\u00f0u m\u00e1li \u00ed rita\u00f0an texta. Sl\u00edkur b\u00fana\u00f0ur fyrir \u00edslensku var ger\u00f0ur \u00e1ri\u00f0 2012 \u00ed samvinnu Google vi\u00f0 \u00edslenska a\u00f0ila og er n\u00fa \u00ed s\u00edmum me\u00f0 Android-st\u00fdrikerfi og \u00ed Google Chrome-vafranum. H\u00e6gt er a\u00f0 nota talgreininn vi\u00f0 leit \u00e1 netinu, til a\u00f0 skrifa sm\u00e1skilabo\u00f0 og t\u00f6lvup\u00f3st, minnisatri\u00f0i og fleira. Einnig er h\u00e6gt a\u00f0 pr\u00f3fa <a href=\"https:\/\/tal.ru.is\/\">talgreini<\/a> \u00e1 vef H\u00e1sk\u00f3lans \u00ed Reykjav\u00edk. Sl\u00edkur b\u00fana\u00f0ur getur vitaskuld n\u00fdst \u00f6llum m\u00e1lnotendum en ekki s\u00edst f\u00f3lki sem eru hreyfihamla\u00f0 og \u00e1 erfitt me\u00f0 a\u00f0 nota lyklabor\u00f0 til a\u00f0 rita texta.<\/p>\n<p>Eitt veigamesta svi\u00f0 m\u00e1lt\u00e6kni eru <em>v\u00e9lr\u00e6nar \u00fe\u00fd\u00f0ingar<\/em>, \u00fear sem hugb\u00fana\u00f0ur er nota\u00f0ur til a\u00f0 \u00fe\u00fd\u00f0a texta af einu m\u00e1li \u00e1 anna\u00f0. <a href=\"http:\/\/translate.google.com\/\">Google Translate<\/a> er \u00feekktasti b\u00fana\u00f0urinn \u00e1 \u00feessu svi\u00f0i og getur \u00fe\u00fdtt milli fj\u00f6lda tungum\u00e1la, \u00fear \u00e1 me\u00f0al milli \u00edslensku og annarra m\u00e1la. G\u00e6\u00f0i \u00fe\u00fd\u00f0inganna eru misj\u00f6fn en fara vaxandi eftir \u00fev\u00ed sem b\u00fana\u00f0urinn er lengur \u00ed notkun og hefur fleiri g\u00f6gn til a\u00f0 l\u00e6ra af. Ekkert gott \u00fe\u00fd\u00f0ingarforrit hefur enn veri\u00f0 \u00fer\u00f3a\u00f0 fyrir \u00edslensku.<\/p>\n<p>En tungum\u00e1li\u00f0 er ekki bara \u00feiggjandi \u00ed samvinnu vi\u00f0 t\u00f6lvut\u00e6knina. \u00dea\u00f0 er l\u00edka nota\u00f0 \u00e1 margv\u00edslegan h\u00e1tt til a\u00f0 gera t\u00e6knina a\u00f0gengilegri og au\u00f0velda m\u00f6nnum a\u00f0 n\u00fdta s\u00e9r hana. \u00dear m\u00e1 nefna \u00fdmiss konar \u00fej\u00f3nustuver \u00fear sem t\u00f6lva hlustar \u00e1 erindi notandans og greinir merkingu \u00feess. S\u00fa greining er s\u00ed\u00f0an send til gagnabanka, \u00fear sem er a\u00f0 finna sv\u00f6r vi\u00f0 margv\u00edslegum fyrirspurnum, og vi\u00f0eigandi svar s\u00f3tt \u00ed bankann. \u00dev\u00ed svari er svo breytt \u00ed e\u00f0lilega setningu og h\u00fan send til t\u00f6lvub\u00fana\u00f0ar sem les hana fyrir notandann. \u00deetta ferli er alsj\u00e1lfvirkt og byggist \u00e1 margv\u00edslegri og fl\u00f3kinni greiningu \u00e1 tali notandans; hlj\u00f3\u00f0greiningu, or\u00f0greiningu, setningagreiningu, merkingargreiningu og fleira.<\/p>\n<p>Einnig m\u00e1 nefna notkun m\u00e1lsins vi\u00f0 stj\u00f3rn t\u00f6lva og \u00fdmiss konar t\u00f6lvust\u00fdr\u00f0ra t\u00e6kja. \u00dea\u00f0 fer mj\u00f6g \u00ed v\u00f6xt a\u00f0 sl\u00edkum t\u00e6kjum s\u00e9 stj\u00f3rna\u00f0 me\u00f0 venjulegu m\u00e1li, anna\u00f0 hvort ritu\u00f0u e\u00f0a t\u00f6lu\u00f0u. Skipanir eru \u00fe\u00e1 \u00fdmist slegnar inn \u00e1 lyklabor\u00f0 e\u00f0a tala\u00f0ar \u00ed hlj\u00f3\u00f0nema, \u00ed sta\u00f0 \u00feess a\u00f0 \u00fdta \u00e1 takka e\u00f0a velja kost \u00ed valmynd. \u00deetta mun \u00e1 n\u00e6stunni taka til s\u00edfellt fj\u00f6lbreyttari t\u00e6kja, svo sem \u00fdmiss konar framlei\u00f0slut\u00e6kja, heimilist\u00e6kja og b\u00edla. En sl\u00edk t\u00e6ki skilja yfirleitt ekki \u00edslensku \u2013 enn sem komi\u00f0 er.<\/p>\n<p>Til a\u00f0 t\u00f6lvur og t\u00e6ki skilji \u00edslensku sl\u00edkt \u00fearf a\u00f0 byggja upp \u00feekkingargrunna sem hafa a\u00f0 geyma margv\u00edslegar og n\u00e1kv\u00e6mar uppl\u00fdsingar um tungum\u00e1li\u00f0. Til a\u00f0 h\u00e6gt s\u00e9 a\u00f0 \u00fer\u00f3a forrit til m\u00e1lfarslei\u00f0r\u00e9ttingar \u00fearf til d\u00e6mis a\u00f0 liggja fyrir n\u00e1kv\u00e6m og \u00edtarleg greining \u00e1 \u00edslenskri setningager\u00f0 \u2013 mun n\u00e1kv\u00e6mari og \u00edtarlegri en finna m\u00e1 \u00ed handb\u00f3kum og kennslub\u00f3kum. \u00dea\u00f0 er ekki h\u00e6gt a\u00f0 \u00fatb\u00faa lei\u00f0r\u00e9ttingarforrit nema skr\u00e1 n\u00e1kv\u00e6mlega hva\u00f0a setningager\u00f0ir eru leyfilegar \u00ed m\u00e1linu og hverjar ekki og jafnframt semja l\u00fdsingu \u00e1 \u00fev\u00ed hvernig eigi a\u00f0 lagf\u00e6ra \u00fea\u00f0 sem betur m\u00e1 fara.<\/p>\n<p>Sprenging \u00ed hagn\u00fdtingu gervigreindar og v\u00e9lr\u00e6ns n\u00e1ms \u00e1 s\u00ed\u00f0ustu \u00e1rum hefur leitt til \u00feess a\u00f0 mikilv\u00e6gasta forsenda \u00feess a\u00f0 \u00fer\u00f3a m\u00e1lt\u00e6knib\u00fana\u00f0 er n\u00fa gr\u00ed\u00f0arst\u00f3r m\u00e1lleg gagnas\u00f6fn \u2013 or\u00f0as\u00f6fn, textas\u00f6fn, hlj\u00f3\u00f0s\u00f6fn og fleira. \u00de\u00e6r a\u00f0fer\u00f0ir sem n\u00fa eru mest nota\u00f0ar byggjast \u00e1 \u00fev\u00ed a\u00f0 t\u00f6lvur eru l\u00e1tnar lesa gr\u00ed\u00f0arlega miki\u00f0 af g\u00f6gnum og l\u00e6ra af \u00feeim \u2013 finna \u00ed \u00feeim mynstur sem \u00fe\u00e6r geta s\u00ed\u00f0an nota\u00f0 til a\u00f0 byggja upp \u00feekkingargrunna um tungum\u00e1li\u00f0. \u00deessa \u00feekkingargrunna er svo aftur h\u00e6gt a\u00f0 n\u00fdta \u00ed ger\u00f0 margs kyns hugb\u00fana\u00f0ar til m\u00e1lvinnslu, svo sem lei\u00f0r\u00e9ttingab\u00fana\u00f0ar, \u00fe\u00fd\u00f0ingaforrita, talgervla, talgreina og svo framvegis.<\/p>\n<p>Uppbyggingarstarf \u00ed m\u00e1lt\u00e6kni er d\u00fdrt. \u00dea\u00f0 kostar jafnmiki\u00f0 a\u00f0 koma upp m\u00e1lt\u00e6kni fyrir \u00edslensku og fyrir tungum\u00e1l millj\u00f3na\u00fej\u00f3\u00f0a. Margs konar m\u00e1lt\u00e6knib\u00fana\u00f0ur er vissulega g\u00f3\u00f0 marka\u00f0svara og skilar miklum tekjum sem standa undir h\u00e1um \u00fer\u00f3unarkostna\u00f0i \u2013 ef marka\u00f0urinn er n\u00f3gu st\u00f3r. En \u00fev\u00ed er ekki a\u00f0 heilsa \u00e1 \u00cdslandi. Vegna sm\u00e6\u00f0ar marka\u00f0arins er lj\u00f3st a\u00f0 \u00fea\u00f0 ver\u00f0ur seint ar\u00f0v\u00e6nlegt a\u00f0 \u00fer\u00f3a d\u00fdran m\u00e1lt\u00e6knib\u00fana\u00f0 fyrir \u00edslensku. Vilji \u00cdslendingar a\u00f0 \u00edslenska s\u00e9 noth\u00e6f innan t\u00f6lvu- og uppl\u00fdsingat\u00e6kninnar \u00fearf opinber stu\u00f0ningur vi\u00f0 \u00fer\u00f3unarstarf a\u00f0 koma til.<\/p>\n<p>\u00deegar mikilv\u00e6gi m\u00e1lt\u00e6kni fyrir \u00edslensku er meti\u00f0 ver\u00f0ur a\u00f0 l\u00edta til \u00feess a\u00f0 uppl\u00fdsingat\u00e6knin er or\u00f0in mikilv\u00e6gur \u00fe\u00e1ttur \u00ed daglegu l\u00edfi alls almennings \u00ed landinu. Ef ekki ver\u00f0ur h\u00e6gt a\u00f0 nota \u00edslensku \u00e1 \u00f6llum svi\u00f0um uppl\u00fdsingat\u00e6kninnar kemur upp splunkun\u00fd sta\u00f0a, sem ekki \u00e1 s\u00e9r hli\u00f0st\u00e6\u00f0u fyrr \u00ed m\u00e1ls\u00f6gunni. \u00de\u00e1 ver\u00f0ur or\u00f0i\u00f0 til mikilv\u00e6gt svi\u00f0 \u00ed daglegu l\u00edfi venjulegs f\u00f3lks, \u00fear sem m\u00f3\u00f0urm\u00e1li\u00f0 er gagnsl\u00edti\u00f0 e\u00f0a \u00f3noth\u00e6ft. Hva\u00f0a \u00e1hrif hef\u00f0i sl\u00edkt umd\u00e6mistap \u00e1 m\u00e1lnotendur og m\u00e1lsamf\u00e9lagi\u00f0? Hva\u00f0 g\u00e6ti gerst ef m\u00f3\u00f0urm\u00e1li\u00f0 yr\u00f0i ekki lengur noth\u00e6ft \u00ed n\u00fdrri t\u00e6kni og \u00f6\u00f0ru sem er n\u00fdtt og spennandi; \u00e1 svi\u00f0um \u00fear sem n\u00fdsk\u00f6pun af \u00fdmsu tagi \u00e1 s\u00e9r sta\u00f0; og \u00e1 svi\u00f0um \u00fear sem n\u00fd atvinnut\u00e6kif\u00e6ri bj\u00f3\u00f0ast?<\/p>\n<p>En \u00edslensk m\u00e1lt\u00e6kni hefur ekki eing\u00f6ngu gildi fyrir tungum\u00e1li\u00f0 og var\u00f0veislu \u00feess. M\u00e1lnotendurnir og hagsmunir \u00feeirra skipta ekki s\u00ed\u00f0ur m\u00e1li. \u00dea\u00f0 er mannr\u00e9ttindam\u00e1l a\u00f0 geta nota\u00f0 m\u00f3\u00f0urm\u00e1li\u00f0 \u00e1 \u00f6llum svi\u00f0um daglegs l\u00edfs, b\u00e6\u00f0i \u00ed starfi og leik \u2013 l\u00edka innan uppl\u00fdsingat\u00e6kninnar. Til a\u00f0 svo megi ver\u00f0a \u00fearf allur algengur hugb\u00fana\u00f0ur a\u00f0 vera \u00e1 \u00edslensku, lei\u00f0r\u00e9ttingarhugb\u00fana\u00f0ur fyrir \u00edslenskan texta \u00fearf a\u00f0 vera til, \u00fea\u00f0 \u00fearf a\u00f0 vera h\u00e6gt a\u00f0 tala vi\u00f0 \u00fdmis t\u00f6lvust\u00fdr\u00f0 t\u00e6ki \u00e1 \u00edslensku, til \u00feurfa a\u00f0 vera \u00fe\u00fd\u00f0ingarforrit sem geta \u00fe\u00fdtt milli \u00edslensku og annarra m\u00e1la, og m\u00e1lnotendur \u00feurfa a\u00f0 eiga a\u00f0gang a\u00f0 hugb\u00fana\u00f0i sem getur unni\u00f0 fl\u00f3knar uppl\u00fdsingar \u00far texta- og gagnas\u00f6fnum og leita\u00f0 \u00ed \u00feeim \u00e1 margv\u00edslegan h\u00e1tt. Enn vantar miki\u00f0 upp \u00e1 a\u00f0 \u00feessi markmi\u00f0 n\u00e1ist.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>M\u00e1lt\u00e6kni er tilt\u00f6lulega n\u00fdlegt or\u00f0 \u00ed \u00edslensku \u2013 \u00fe\u00fd\u00f0ing \u00e1 \u00fev\u00ed sem \u00e1 ensku nefnist language technology. Einnig hefur or\u00f0i\u00f0 tungut\u00e6kni veri\u00f0 nota\u00f0 um sama hugtak. \u00cd stuttu m\u00e1li m\u00e1 segja a\u00f0 me\u00f0 m\u00e1lt\u00e6kni s\u00e9 \u00e1tt vi\u00f0 hvers kyns samvinnu tungum\u00e1ls og t\u00f6lvut\u00e6kni sem hefur einhvern hagn\u00fdtan tilgang; beinist a\u00f0 \u00fev\u00ed a\u00f0 hanna e\u00f0a \u00fatb\u00faa [&hellip;]<\/p>\n","protected":false},"author":141,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[158635],"tags":[],"class_list":["post-3081","post","type-post","status-publish","format-standard","hentry","category-malfar"],"_links":{"self":[{"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/posts\/3081","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/users\/141"}],"replies":[{"embeddable":true,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/comments?post=3081"}],"version-history":[{"count":1,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/posts\/3081\/revisions"}],"predecessor-version":[{"id":3082,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/posts\/3081\/revisions\/3082"}],"wp:attachment":[{"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/media?parent=3081"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/categories?post=3081"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/uni.hi.is\/eirikur\/wp-json\/wp\/v2\/tags?post=3081"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}