Projects per year
Abstract
Hikayat Lonthoir, a rare saga manuscript collection originating from the Banda Archipelago, Maluku, Indonesia, retains significant Indigenous oral history amidst the Western colonial narrative. This study seeks to leverage computational methods to analyze the historic manuscript that constitutes a combination of OCR-supervised transcription, corpus linguistic profiling, semantic clustering (Word2Vec + K-Means), and named entity network analysis. A validation of the dataset is performed on 2793 cleaned word tokens towards Indonesian and Malay dictionaries, showing that 50.3% overlapped with both dictionaries, with strong cross-dictionary agreement (κ = 0.76). The lexical analysis indicates that monarchy/governance, kinship, maritime vocabulary, and extensive morphological productivity (me-, di-, ter-, pe-/per-, -nya, -an), while semantic and network analyses identify two narrative cores, developed into Aarne–Thompson–Uther (ATU) and Stith Thompson’s Motif Index of Folk Literature classification systems. These findings demonstrate how computational methods can extract structural, thematic, and relational patterns from historical manuscripts and contribute evidence-based insights to digital philology and historical linguistics.
| Original language | English |
|---|---|
| Article number | 1069 |
| Journal | Information (Switzerland) |
| Volume | 16 |
| Issue number | 12 |
| DOIs | |
| Publication status | Published - 4 Dec 2025 |
Keywords
- Banda Archipelago
- Digital Philology
- Linguistic Documentation
- NLP
- Oral History
- Semantic Analysis
Projects
- 2 Finished
-
Mapping the Languages of Indigenous Clans in the Eastern Spice Islands
1/04/24 → 30/11/25
Project: Governmental Research Project
-
Place Names and Cultural Identity: Toponyms and Their Diachronic Evolution among the Kula People from Alor Island
1/01/24 → 31/12/25
Project: Internal Research Project
Research output
- 1 Article
-
Unravelling Lexical and Narrative Patterns in the Hikayat Lonthoir: A Computational Linguistics Approach
Kersapati, M. I., PERONO CACCIAFOCO, F., Sihite, B., WU, S., Putri Widyaningrum, K., Atqa, M. & BIN TONI, E. A., 4 Dec 2025, In: Information (Switzerland). 16, 12, p. 1-26 26 p., 1069.Research output: Contribution to journal › Article › peer-review
Open Access
Activities
- 2 Other
-
Mapping the Languages of Indigenous Clans in the Eastern Spice Islands (Research Grant)
PERONO CACCIAFOCO, F. (Participant)
1 Apr 2024 → 30 Nov 2025Activity: Other
-
Place Names and Cultural Identity: Toponyms and Their Diachronic Evolution among the Kula People from Alor Island (Research Grant)
PERONO CACCIAFOCO, F. (Participant)
1 Jan 2024 → 31 Dec 2025Activity: Other