Show simple item record

dc.contributor.authorGasser, Michael
dc.contributor.authorKifle, Nazareth Amlesom
dc.contributor.authorEphrem, Binyam Seyoum
dc.date.accessioned2021-01-30T23:07:16Z
dc.date.available2021-01-30T23:07:16Z
dc.date.created2020-12-15T14:23:56Z
dc.date.issued2020
dc.identifier.citationProceedings of the 7th VarDial Workshop on NLP for Similar Languages, Varieties and Dialects. 2020, 47-56en_US
dc.identifier.isbn978-1-952148
dc.identifier.urihttps://hdl.handle.net/11250/2725438
dc.description.abstractFor languages with complex morphology, word-to-word translation is a task with various potential applications, for example, in information retrieval, language instruction, and dictionary creation, as well as in machine translation. In this paper, we confine ourselves to the subtask of character alignment for the particular case of families of related languages with very few resources for most or all members. There are many such families; we focus on the subgroup of Semitic languages spoken in Ethiopia and Eritrea. We begin with an adaptation of the familiar alignment algorithms behind statistical machine translation, modifying them as appropriate for our task. We show how character alignment can reveal morphological, phonological, and orthographic correspondences among related languages.en_US
dc.language.isoengen_US
dc.publisherAssociation for Computational Linguisticsen_US
dc.relation.ispartofProceedings of the 7th VarDial Workshop on NLP for Similar Languages, Varieties and Dialects
dc.rightsNavngivelse 4.0 Internasjonal*
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/deed.no*
dc.titleCharacter Alignment in Morphologically Complex Translation Sets for Related Languagesen_US
dc.typeChapteren_US
dc.description.versionpublishedVersionen_US
dc.subject.nsiVDP::Humaniora: 000::Språkvitenskapelige fag: 010en_US
dc.source.pagenumber47-56en_US
dc.identifier.cristin1860092
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Navngivelse 4.0 Internasjonal
Except where otherwise noted, this item's license is described as Navngivelse 4.0 Internasjonal