NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Linuwih, Bryan Wilie, Galih Muridan, Genta Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, Pascale Fung
- Anthology ID:
- 2023.ijcnlp-main.60
- Volume:
- Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- November
- Year:
- 2023
- Address:
- Nusa Dua, Bali
- Editors:
- Jong C. Park, Yuki Arase, Baotian Hu, Wei Lu, Derry Wijaya, Ayu Purwarianti, Adila Alfa Krisnadhi
- Venues:
- IJCNLP | AACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 921–945
- Language:
- URL:
- https://aclanthology.org/2023.ijcnlp-main.60
- DOI:
- 10.18653/v1/2023.ijcnlp-main.60
- Bibkey:
- Cite (ACL):
- Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Linuwih, Bryan Wilie, Galih Muridan, Genta Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, and Pascale Fung. 2023. NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 921–945, Nusa Dua, Bali. Association for Computational Linguistics.
- Cite (Informal):
- NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages (Cahyawijaya et al., IJCNLP-AACL 2023)
- Copy Citation:
- PDF:
- https://aclanthology.org/2023.ijcnlp-main.60.pdf
Export citation
@inproceedings{cahyawijaya-etal-2023-nusawrites, title = "{N}usa{W}rites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages", author = "Cahyawijaya, Samuel and Lovenia, Holy and Koto, Fajri and Adhista, Dea and Dave, Emmanuel and Oktavianti, Sarah and Akbar, Salsabil and Lee, Jhonson and Shadieq, Nuur and Cenggoro, Tjeng Wawan and Linuwih, Hanung and Wilie, Bryan and Muridan, Galih and Winata, Genta and Moeljadi, David and Aji, Alham Fikri and Purwarianti, Ayu and Fung, Pascale", editor = "Park, Jong C. and Arase, Yuki and Hu, Baotian and Lu, Wei and Wijaya, Derry and Purwarianti, Ayu and Krisnadhi, Adila Alfa", booktitle = "Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)", month = nov, year = "2023", address = "Nusa Dua, Bali", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2023.ijcnlp-main.60", doi = "10.18653/v1/2023.ijcnlp-main.60", pages = "921--945", }
<?xml version="1.0" encoding="UTF-8"?> <modsCollection xmlns="http://www.loc.gov/mods/v3"> <mods ID="cahyawijaya-etal-2023-nusawrites"> <titleInfo> <title>NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages</title> </titleInfo> <name type="personal"> <namePart type="given">Samuel</namePart> <namePart type="family">Cahyawijaya</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Holy</namePart> <namePart type="family">Lovenia</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Fajri</namePart> <namePart type="family">Koto</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Dea</namePart> <namePart type="family">Adhista</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Emmanuel</namePart> <namePart type="family">Dave</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Sarah</namePart> <namePart type="family">Oktavianti</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Salsabil</namePart> <namePart type="family">Akbar</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Jhonson</namePart> <namePart type="family">Lee</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Nuur</namePart> <namePart type="family">Shadieq</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Tjeng</namePart> <namePart type="given">Wawan</namePart> <namePart type="family">Cenggoro</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Hanung</namePart> <namePart type="family">Linuwih</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Bryan</namePart> <namePart type="family">Wilie</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Galih</namePart> <namePart type="family">Muridan</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Genta</namePart> <namePart type="family">Winata</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">David</namePart> <namePart type="family">Moeljadi</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Alham</namePart> <namePart type="given">Fikri</namePart> <namePart type="family">Aji</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ayu</namePart> <namePart type="family">Purwarianti</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Pascale</namePart> <namePart type="family">Fung</namePart> <role> <roleTerm authority="marcrelator" type="text">author</roleTerm> </role> </name> <originInfo> <dateIssued>2023-11</dateIssued> </originInfo> <typeOfResource>text</typeOfResource> <relatedItem type="host"> <titleInfo> <title>Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)</title> </titleInfo> <name type="personal"> <namePart type="given">Jong</namePart> <namePart type="given">C</namePart> <namePart type="family">Park</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Yuki</namePart> <namePart type="family">Arase</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Baotian</namePart> <namePart type="family">Hu</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Wei</namePart> <namePart type="family">Lu</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Derry</namePart> <namePart type="family">Wijaya</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Ayu</namePart> <namePart type="family">Purwarianti</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <name type="personal"> <namePart type="given">Adila</namePart> <namePart type="given">Alfa</namePart> <namePart type="family">Krisnadhi</namePart> <role> <roleTerm authority="marcrelator" type="text">editor</roleTerm> </role> </name> <originInfo> <publisher>Association for Computational Linguistics</publisher> <place> <placeTerm type="text">Nusa Dua, Bali</placeTerm> </place> </originInfo> <genre authority="marcgt">conference publication</genre> </relatedItem> <identifier type="citekey">cahyawijaya-etal-2023-nusawrites</identifier> <identifier type="doi">10.18653/v1/2023.ijcnlp-main.60</identifier> <location> <url>https://aclanthology.org/2023.ijcnlp-main.60</url> </location> <part> <date>2023-11</date> <extent unit="page"> <start>921</start> <end>945</end> </extent> </part> </mods> </modsCollection>
%0 Conference Proceedings %T NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages %A Cahyawijaya, Samuel %A Lovenia, Holy %A Koto, Fajri %A Adhista, Dea %A Dave, Emmanuel %A Oktavianti, Sarah %A Akbar, Salsabil %A Lee, Jhonson %A Shadieq, Nuur %A Cenggoro, Tjeng Wawan %A Linuwih, Hanung %A Wilie, Bryan %A Muridan, Galih %A Winata, Genta %A Moeljadi, David %A Aji, Alham Fikri %A Purwarianti, Ayu %A Fung, Pascale %Y Park, Jong C. %Y Arase, Yuki %Y Hu, Baotian %Y Lu, Wei %Y Wijaya, Derry %Y Purwarianti, Ayu %Y Krisnadhi, Adila Alfa %S Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers) %D 2023 %8 November %I Association for Computational Linguistics %C Nusa Dua, Bali %F cahyawijaya-etal-2023-nusawrites %R 10.18653/v1/2023.ijcnlp-main.60 %U https://aclanthology.org/2023.ijcnlp-main.60 %U https://doi.org/10.18653/v1/2023.ijcnlp-main.60 %P 921-945
Markdown (Informal)
[NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages](https://aclanthology.org/2023.ijcnlp-main.60) (Cahyawijaya et al., IJCNLP-AACL 2023)
- NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages (Cahyawijaya et al., IJCNLP-AACL 2023)
ACL
- Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, Hanung Linuwih, Bryan Wilie, Galih Muridan, Genta Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti, and Pascale Fung. 2023. NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages. In Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 921–945, Nusa Dua, Bali. Association for Computational Linguistics.