Papers

CoNLL#: Fine-grained Error Analysis and a Corrected Test Set for CoNLL-03 English
Andrew Rueda, Elena Alvarez-Mellado, Constantine Lignos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024

ParaNames 1.0: Creating an Entity Name Corpus for 400+ Languages Using Wikidata
Jonne Sälevä, Constantine Lignos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024

QueryNER: Segmentation of E-commerce Queries
Chester Palen-Michel, Lizzie Liang, Zhe Wu, Constantine Lignos
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), 2024

Improving NER Research Workflows with SeqScore
Constantine Lignos, Maya Kruse, Andrew Rueda
Proceedings of The 3rd Workshop for Natural Language Processing Open Source Software (NLP-OSS 2023), 2023

LR-Sum: Summarization for Less-Resourced Languages
Chester Palen-Michel, Constantine Lignos
Findings of the Association for Computational Linguistics: ACL 2023, 2023

What changes when you randomly choose BPE merge operations? Not much.  (publisher link)
Jonne Sälevä, Constantine Lignos
Proceedings of the The Fourth Workshop on Insights from Negative Results in NLP, 2023 (version on arXiv contains more experiments than the official ACL Anthology version)

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition
David Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman, Chester Palen-Michel, Constantine Lignos, Jesujoba Alabi, Shamsuddeen Muhammad, Peter Nabende, Cheikh M. Bamba Dione, Andiswa Bukula, Rooweither Mabuya, Bonaventure F. P. Dossou, Blessing Sibanda, Happy Buzaaba, Jonathan Mukiibi, Godson Kalipe, Derguene Mbaye, Amelia Taylor, Fatoumata Kabore, Chris Chinenye Emezue, Anuoluwapo Aremu, Perez Ogayo, Catherine Gitau, Edwin Munkoh-Buabeng, Victoire Memdjokam Koagne, Allahsera Auguste Tapo, Tebogo Macucwa, Vukosi Marivate, Mboning Tchiaze Elvis, Tajuddeen Gwadabe, Tosin Adewumi, Orevaoghene Ahia, Joyce Nakatumba-Nabende, Neo Lerato Mokono, Ignatius Ezeani, Chiamaka Chukwuneke, Mofetoluwa Oluwaseun Adeyemi, Gilles Quentin Hacheme, Idris Abdulmumin, Odunayo Ogundepo, Oreen Yousuf, Tatiana Moteu, Dietrich Klakow
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Borrowing or Codeswitching? Annotating for Finer-Grained Distinctions in Language Mixing
Elena Alvarez-Mellado, Constantine Lignos
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Multilingual Open Text Release 1: Public Domain News in 44 Languages
Chester Palen-Michel, June Kim, Constantine Lignos
Proceedings of the Thirteenth Language Resources and Evaluation Conference, 2022

Detecting Unassimilated Borrowings in Spanish: An Annotated Corpus and Approaches to Modeling
Elena Álvarez-Mellado, Constantine Lignos
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, 2022

Toward More Meaningful Resources for Lower-resourced Languages
Constantine Lignos, Nolan Holley, Chester Palen-Michel, Jonne Sälevä
Findings of the Association for Computational Linguistics: ACL 2022, 2022

ParaNames: A Massively Multilingual Entity Name Corpus
Jonne Sälevä, Constantine Lignos
Proceedings of the 4th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, 2022. (Non-archival extended abstract)

SeqScore: Addressing Barriers to Reproducible Named Entity Recognition Evaluation
Chester Palen-Michel, Nolan Holley, Constantine Lignos
Proceedings of the 2nd Workshop on Evaluation and Comparison of NLP Systems, 2021

MasakhaNER: Named Entity Recognition for African Languages
David Ifeoluwa Adelani, Jade Abbott, Graham Neubig, Daniel D’souza, Julia Kreutzer, Constantine Lignos, Chester Palen-Michel, Happy Buzaaba, Shruti Rijhwani, Sebastian Ruder, Stephen Mayhew, Israel Abebe Azime, Shamsuddeen H. Muhammad, Chris Chinenye Emezue, Joyce Nakatumba-Nabende, Perez Ogayo, Aremu Anuoluwapo, Catherine Gitau, Derguene Mbaye, Jesujoba Alabi, Seid Muhie Yimam, Tajuddeen Rabiu Gwadabe, Ignatius Ezeani, Rubungo Andre Niyongabo, Jonathan Mukiibi, Verrah Otiende, Iroro Orife, Davis David, Samba Ngom, Tosin Adewumi, Paul Rayson, Mofetoluwa Adeyemi, Gerald Muriuki, Emmanuel Anebi, Chiamaka Chukwuneke, Nkiruka Odu, Eric Peter Wairagala, Samuel Oyerinde, Clemencia Siro, Tobius Saul Bateesa, Temilola Oloyede, Yvonne Wambui, Victor Akinode, Deborah Nabagereka, Maurice Katusiime, Ayodele Awokoya, Mouhamadane MBOUP, Dibora Gebreyohannes, Henok Tilaye, Kelechi Nwaike, Degaga Wolde, Abdoulaye Faye, Blessing Sibanda, Orevaoghene Ahia, Bonaventure F. P. Dossou, Kelechi Ogueji, Thierno Ibrahima DIOP, Abdoulaye Diallo, Adewale Akinfaderin, Tendai Marengereke, Salomey Osei
Transactions of the Association for Computational Linguistics, Volume 9, 1116–1131. 2021

Macro-Average: Rare Types Are Important Too
Thamme Gowda, Weiqiu You, Constantine Lignos, Jonathan May
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2021

Mining Wikidata for Name Resources for African Languages
Jonne Sälevä, Constantine Lignos
AfricaNLP Workshop at the 16th Conference of the European Chapter of the Association for Computational Linguistics, 2021. (Non-archival)

The Effectiveness of Morphology-aware Segmentation in Low-Resource Neural Machine Translation
Jonne Sälevä, Constantine Lignos
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, 164-174. 2021

TMR: Evaluating NER Recall on Tough Mentions
Jingxuan Tu, Constantine Lignos
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, 155-163. 2021

Effective Architectures for Low Resource Multilingual Named Entity Transliteration
Molly Moran, Constantine Lignos
Proceedings of the 3rd Workshop on Technologies for MT of Low Resource Languages, 79-86. 2020

If You Build Your Own NER Scorer, Non-replicable Results Will Come
Constantine Lignos, Marjan Kamyab
Proceedings of the First Workshop on Insights from Negative Results in NLP, 94-99. 2020

Real-World Causal Relationship Discovery from Text
Constantine Lignos, Chester Palen-Michel, Oskar Singer, Pedro Szekely, and Elizabeth Boschee
Proceedings of the 18th Annual International Semantic Web Conference, 2019

The Challenges of Optimizing Machine Translation for Low Resource Cross-Language Information Retrieval  (publisher link)
Constantine Lignos, Daniel Cohen, Yen-Chieh Lien,Pratik Mehta, W. Bruce Croft, and Scott Miller
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), 3488-3493. 2019 (ACL Anthology version is missing appendix)

SARAL: A Low-Resource Cross-Lingual Domain-Focused Information Retrieval System for Effective Rapid Document Triage
Elizabeth Boschee, Joel Barry, Jayadev Billa, Marjorie Freedman, Thamme Gowda, Constantine Lignos, Chester Palen-Michel, Michael Pust, Banriskhem Kayang Khonglah, Srikanth Madikeri, Jonathan May, and Scott Miller
Proceedings of the 57th Conference of the Association for Computational Linguistics: System Demonstrations, 19-24. 2019

Combining rule-based and statistical mechanisms for low-resource named entity recognition  (publisher link)  (preprint)
Ryan Gabbard, Jay DeYoung, Constantine Lignos, Marjorie Freedman, Ralph Weischedel
Machine Translation, 32 (1-2), 31-43. 2018

Morphology and language acquisition  (publisher link)
Constantine Lignos, Charles Yang
Cambridge Handbook of Morphology, 765-791. Andrew Hippisley and Gregory T. Stump (Eds.). 2017

Provably correct reactive control from natural language  (publisher link)
Constantine Lignos, Vasumathi Raman, Cameron Finucane, Mitchell Marcus, Hadas Kress-Gazit
Autonomous Robots, 38 (1), 89-105. 2015

Spectro-temporal correlates of lexical access during auditory lexical decision
Jonathan Brennan, Constantine Lignos, David Embick, and Timothy P.L. Roberts
Brain and Language, 133, 39-46. 2014

Modeling Words in the Mind
Constantine Lignos
University of Pennsylvania PhD Dissertation, 2013

Sorry Dave, I'm afraid I can't do that: Explaining unachievable robot tasks using natural language
Vasumathi Raman, Constantine Lignos, Cameron Finucane, Kenton CT Lee, Mitchell P. Marcus, and Hadas Kress-Gazit
Proceedings of Robotics: Science and Systems IX, 2013

Make it so: Continuous, flexible natural language interaction with an autonomous robot
Daniel J. Brooks, Constantine Lignos, Cameron Finucane, Mikhail S. Medvedev, Ian Perera, Vasumathi Raman, Hadas Kress-Gazit, Mitchell P. Marcus, Holly A Yanco.
Proccedings of the Grounding Language for Physical Systems Workshop at the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012

Revisiting frequency and storage in morphological processing
Constantine Lignos and Kyle Gorman
Proceedings of the 48th Annual Meeting of the Chicago Linguistic Society, 447-461, 2014. (This article was submitted in November 2012, but proceedings were not officially published until 2014.)

Infant word segmentation: An incremental, integrated model
Constantine Lignos
Proceedings of the West Coast Conference on Formal Linguistics 30, 2012

You can't get there from here: On interpreting learning experiments
Constantine Lignos
Proceedings of Penn Linguistics Colloquium 36, 2012

Modeling infant word segmentation
Constantine Lignos
Proceedings of the Fifteenth Conference on Computational Natural Language Learning, 29-38. 2011

Learning from unseen data
Constantine Lignos
Proceedings of the Morpho Challenge 2010 Workshop, 35-38. 2010

Recession segmentation: Simpler online word segmentation using limited resources
Constantine Lignos and Charles Yang
Proceedings of the Fourteenth Conference on Computational Natural Language Learning, 88-97. 2010

Investigating the relationship between linguistic representation and computation through an unsupervised model of human morphology learning  (publisher link)
Erwin Chan and Constantine Lignos
Research on Language and Computation, 8 (2), 209-238. 2010

Evidence for a morphological acquisition model from development data
Constantine Lignos, Erwin Chan, Charles Yang, and Mitchell P. Marcus
Proceedings of the 34th Annual Boston University Conference on Language Development, 2, 269-280. 2010

A rule-based acquisition model adapted for morphological analysis  (publisher link)
Constantine Lignos, Erwin Chan, Mitchell P. Marcus, and Charles Yang
Multilingual Information Access Evaluation I. Text Retrieval Experiments. Lecture Notes in Computer Science, 6241, 658-665. 2010

A rule-based unsupervised morphology learning framework
Constantine Lignos, Erwin Chan, Mitchell P. Marcus, and Charles Yang
Working Notes of the 10th Workshop of the Cross-Language Evaluation Forum (CLEF), 2009

Effects of head movement on perceptions of humanoid robot behavior
Emily Wang, Constantine Lignos, Ashish Vatsal, and Brian Scassellati
HRI '06: Proceedings of the 1st ACM SIGCHI/SIGART Conference on Human-Robot Interaction, 180-185, 2006