TY - GEN
T1 - Minoan Linguistic Resources
T2 - 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, LaTeCH 2015
AU - PETROLITO, Tommaso
AU - PETROLITO, Ruggero
AU - PERONO CACCIAFOCO, Francesco
AU - WINTERSTEIN, Grégoire
N1 - PETROLITO, Tommaso, and Ruggero PETROLITO, Francesco PERONO CACCIAFOCO, Grégoire WINTERSTEIN. (2015). Minoan Linguistic Resources: The Linear A Digital Corpus. Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities (LaTech) / ACL (Association for Computational Linguistics)-IJCNLP, July 26-31, 2015, Beijing, PRC (China National Convention Center - CNCC): 95-104
Publisher Copyright:
© 2015 Proceedings of the Annual Meeting of the Association for Computational Linguistics.
PY - 2015/7
Y1 - 2015/7
N2 - This paper describes the Linear A/Minoan digital corpus and the approaches we applied to develop it. We aim to set up a suitable study resource for Linear A and Minoan. Firstly we start by introducing Linear A and Minoan in order to make it clear why we should develop a digital marked up corpus of the existing Linear A transcriptions. Secondly we list and describe some of the existing resources about Linear A: Linear A documents (seals, statuettes, vessels etc.), the traditional encoding systems (standard code numbers referring to distinct symbols), a Linear A font, and the newest (released on June 16th 2014) Unicode Standard Characters set for Linear A. Thirdly we explain our choice concerning the data format: why we decided to digitize the Linear A resources; why we decided to convert all the transcriptions in standard Unicode characters; why we decided to use an XML format; why we decided to implement the TEI-EpiDoc DTD. Lastly we describe: the developing process (from the data collection to the issues we faced and the solving strategies); a new font we developed (synchronized with the Unicode Characters Set) in order to make the data readable even on systems that are not updated. Finally, we discuss the corpus we developed in a Cultural Heritage preservation perspective and suggest some future works. c 2015 Association for Computational Linguistics and The Asian Federation of Natural Language Processing.
AB - This paper describes the Linear A/Minoan digital corpus and the approaches we applied to develop it. We aim to set up a suitable study resource for Linear A and Minoan. Firstly we start by introducing Linear A and Minoan in order to make it clear why we should develop a digital marked up corpus of the existing Linear A transcriptions. Secondly we list and describe some of the existing resources about Linear A: Linear A documents (seals, statuettes, vessels etc.), the traditional encoding systems (standard code numbers referring to distinct symbols), a Linear A font, and the newest (released on June 16th 2014) Unicode Standard Characters set for Linear A. Thirdly we explain our choice concerning the data format: why we decided to digitize the Linear A resources; why we decided to convert all the transcriptions in standard Unicode characters; why we decided to use an XML format; why we decided to implement the TEI-EpiDoc DTD. Lastly we describe: the developing process (from the data collection to the issues we faced and the solving strategies); a new font we developed (synchronized with the Unicode Characters Set) in order to make the data readable even on systems that are not updated. Finally, we discuss the corpus we developed in a Cultural Heritage preservation perspective and suggest some future works. c 2015 Association for Computational Linguistics and The Asian Federation of Natural Language Processing.
KW - Linear A
KW - Language Deciphering
KW - Corpus Linguistics
KW - Digital Humanities
KW - History of Writing
UR - http://www.scopus.com/inward/record.url?scp=85122496155&partnerID=8YFLogxK
UR - https://aclanthology.org/W15-3715/
M3 - Conference Proceeding
AN - SCOPUS:85122496155
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 95
EP - 104
BT - LaTeCH 2015 - Proceedings of the 9th SIGHUM Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
A2 - Zervanou, Kalliopi A.
A2 - van Erp, Marieke
A2 - Alex, Beatrice
PB - Association for Computational Linguistics (ACL)
Y2 - 30 July 2015
ER -