Vis enkel innførsel

dc.contributor.authorGasser, Michael
dc.contributor.authorKifle, Nazareth Amlesom
dc.contributor.authorEphrem, Binyam Seyoum
dc.date.accessioned2021-01-30T23:07:16Z
dc.date.available2021-01-30T23:07:16Z
dc.date.created2020-12-15T14:23:56Z
dc.date.issued2020
dc.identifier.citationProceedings of the 7th VarDial Workshop on NLP for Similar Languages, Varieties and Dialects. 2020, 47-56en_US
dc.identifier.isbn978-1-952148
dc.identifier.urihttps://hdl.handle.net/11250/2725438
dc.description.abstractFor languages with complex morphology, word-to-word translation is a task with various potential applications, for example, in information retrieval, language instruction, and dictionary creation, as well as in machine translation. In this paper, we confine ourselves to the subtask of character alignment for the particular case of families of related languages with very few resources for most or all members. There are many such families; we focus on the subgroup of Semitic languages spoken in Ethiopia and Eritrea. We begin with an adaptation of the familiar alignment algorithms behind statistical machine translation, modifying them as appropriate for our task. We show how character alignment can reveal morphological, phonological, and orthographic correspondences among related languages.en_US
dc.language.isoengen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.relation.ispartofProceedings of the 7th VarDial Workshop on NLP for Similar Languages, Varieties and Dialects
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleCharacter Alignment in Morphologically Complex Translation Sets for Related Languagesen_US
dc.typeChapteren_US
dc.description.versionpublishedVersionen_US
dc.subject.nsiVDP::Humaniora: 000::Språkvitenskapelige fag: 010en_US
dc.source.pagenumber47-56en_US
dc.identifier.cristin1860092
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

Navngivelse 4.0 Internasjonal
Med mindre annet er angitt, så er denne innførselen lisensiert som Navngivelse 4.0 Internasjonal