Çağrı Çöltekin - personal web page

Çağrı Çöltekin and Gül Eskişar (2025) Exploring the Limits of Soft Power: Sentiments and Narratives on China in Turkish Social Media and Political Speeches of Elites. In: Narrating China and Europe in Uncertain Times, pages 199–224 [bib]

Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden and others (2025) ParlaMint II: advancing comparable parliamentary corpora across Europe. Language Resources and Evaluation, 59:2071–2102 [bib]

Johannes Kiesel, Çağrı Çöltekin, Marcel Gohsen, Sebastian Heineking, Maximilian Heinrich, Maik Fröbe, Tim Hagen, Mohammad Aliannejadi, Sharat Anand, Tomaž Erjavec and others (2025) Overview of Touché 2025: argumentation systems. In: International Conference of the Cross-Language Evaluation Forum for European Languages, pages 486–508 [bib]

Matthew Andrews and Cagri Coltekin (2025) Developing a Universal Dependencies Treebank for Alaskan Gwich'in. In: Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 164–173 [bib]

Arofat Akhundjanova, Furkan Akkurt, Bermet Chontaeva, Soudabeh Eslami and Cagri Coltekin (2025) Parallel Universal Dependencies Treebanks for Turkic Languages. In: Proceedings of the Eighth Workshop on Universal Dependencies (UDW, SyntaxFest 2025), pages 129–136 [bib]

Jonathan Washington, Çağrı Çöltekin, Furkan Akkurt, Bermet Chontaeva, Soudabeh Eslami, Gulnura Jumalieva, Aida Kasieva, Aslı Kuzgun, Büşra Marşan and Chihiro Taguchi (2024) Strategies for the Annotation of Pronominalised Locatives in Turkic Universal Dependency Treebanks. In: Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, pages 207–219 [bib]

Leixin Zhang and Çağrı Çöltekin (2024) Tübingen-CL at SemEval-2024 Task 1: Ensemble Learning for Semantic Relatedness Estimation. In: Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 1019–1025 [bib]

Johannes Kiesel, Çağrı Çöltekin, Maximilian Heinrich, Maik Fröbe, Milad Alshomary, Bertrand Longueville, Tomaž Erjavec, Nicolas Handke, Matyáš Kopp, Nikola Ljubešić, Katja Meden, Nailia Mirzakhmedova, Vaidas Morkevičius, Theresa Reitis-Münstermann, Mario Scharfbillig, Nicolas Stefanovitch, Henning Wachsmuth, Martin Potthast and Benno Stein (2024) Overview of Touché 2024: Argumentation Systems. In: Experimental IR Meets Multilinguality, Multimodality, and Interaction. Proceedings of the Fifteenth International Conference of the CLEF Association (CLEF 2024), [bib]

Eleni Vligouridou, Inessa Iliadou and Çağrı Çöltekin (2024) A Treebank of Asia Minor Greek. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1715–1721 [bib]

Recep Cekinel, Çağrı Çöltekin and Pinar Karagoz (2024) Cross-Lingual Learning vs. Low-Resource Fine-Tuning: A Case Study with Fact-Checking in Turkish. In: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 4127–4142 [bib]

Mayank Jobanputra, Maitrey Mehta and Çağrı Çöltekin (2024) A Universal Dependencies Treebank for Gujarati. In: Proceedings of the Joint Workshop on Multiword Expressions and Universal Dependencies (MWE-UD) @ LREC-COLING 2024, pages 56–62 [bib]

Çağrı Çöltekin, Matyáš Kopp, Meden Katja, Vaidas Morkevicius, Nikola Ljubešić and Tomaž Erjavec (2024) Multilingual Power and Ideology identification in the Parliament: a reference dataset and simple baselines. In: Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024, pages 94–100 [bib]

Giulio Cusenza and Çağrı Çöltekin (2024) NLP for Arbëresh: How an Endangered Language Learns to Write in the 21st Century. In: Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 252–256 [bib]

Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešic, Çagrı Çöltekin, Matyáš Kopp, Katja Meden and Taja Kuzman (2023) The ParlaMint Project: Ever-growing Family of Comparable and Interoperable Parliamentary Corpora. In: CLARIN Annual Conference Proceedings, pages 62–65 [bib]

Noëmi Aepli, Çağrı Çöltekin, Rob Van Der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešić, Kai North, Barbara Plank, Yves Scherrer and Marcos Zampieri (2023) Findings of the VarDial Evaluation Campaign 2023. In: Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023), pages 251–261 [bib]

Çağrı Çöltekin, Matteo Brivio and Fidan Can (2023) Tübingen at PoliticIT: Exploring SVMs, Pretrained Language Models, and Linguistic Transfer for Ideology Detection in Social Media. [bib]

Çağrı Çöltekin and Taraka Rama (2023) What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity. Linguistics Vanguard, 9:27–43 [bib]

Özlem Çetinoğlu and Çağrı Çöltekin (2023) Two languages, one treebank: building a Turkish–German code-switching treebank and its challenges. Language Resources and Evaluation, pages 545–579 [bib]

Matteo Brivio and Çağrı Çöltekin (2022) [¬Re] Hate Speech Detection based on Sentiment Knowledge Sharing. ReScience C, 8:\#7 [bib]

Gül Eskisar and Cagrı Cöltekin (2022) Emotions Running High? A Synopsis of the State of Turkish Politics through the ParlaMint Corpus. In: Workshop on Creating, Enriching and Using Parliamentary Corpora, pages 61–70 [bib]

Diana Hoefels, Çağrı Çöltekin and Irina Mădroane (2022) CoRoSeOf - An Annotated Corpus of Romanian Sexist and Offensive Tweets. In: Proceedings of the Language Resources and Evaluation Conference, pages 2269–2281 [bib]

Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp and Meden Katja (2022) ParlaMint II: The Show Must Go On. In: Proceedings of the Workshop ParlaCLARIN III within the 13th Language Resources and Evaluation Conference, pages 1–6 [bib]

Çağrı Çöltekin, A Doğruöz and Özlem Çetinoğlu (2022) Resources for Turkish Natural Language Processing: A critical survey. Language Resources and Evaluation, [bib]

Tomaž Erjavec, Maciej Ogrodniczuk, Petya Osenova, Nikola Ljubešić, Kiril Simov, Andrej Pančur, Michał Rudolf, Matyáš Kopp, Starkaður Barkarson, Steinþór Steingrímsson, Çağrı Çöltekin, Jesse de Does, Katrien Depuydt, Tommaso Agnoloni, Giulia Venturi, María Calzada Pérez, Luciana de Macedo, Costanza Navarretta, Giancarlo Luxardo, Matthew Coole, Paul Rayson, Vaidas Morkevičius, Tomas Krilavičius, Roberts Darǵis, Orsolya Ring, Ruben van Heusden, Maarten Marx and Darja Fišer (2022) The ParlaMint corpora of parliamentary proceedings. Language resources and evaluation, 57:415–448 [bib]

Alisan Balkoca, Abdullah Algan, Cengiz Acarturk and Çağrı Çöltekin (2021) Team ReadMe at CMCL 2021 Shared Task: Predicting Human Reading Patterns by Traditional Oculomotor Control Models and Machine Learning. In: Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 134–140 [bib]

Mihai Manolescu and Çağrı Çöltekin (2021) ROFF - A Romanian Twitter Dataset for Offensive Language. In: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2021), pages 895–900 [bib]

Çağrı Çöltekin (2020) Verification, Reproduction and Replication of NLP Experiments: a Case Study on Parsing Universal Dependencies. In: Proceedings of the Fourth Workshop on Universal Dependencies (UDW 2020), pages 46–56 [pdf] [bib]

Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis and Çağrı Çöltekin (2020) SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). In: Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1425–1447 [bib]

Çağrı Çöltekin (2020) Dialect Identification under Domain Shift: Experiments with Discriminating Romanian and Moldavian. In: Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 186–192 [pdf] [bib]

Çağrı Çöltekin (2020) A Corpus of Turkish Offensive Language on Social Media. In: Proceedings of The 12th Language Resources and Evaluation Conference, pages 6174–6184 [pdf] [bib]

Çağrı Çöltekin (2020) Predicting Educational Achievement Using Linear Models. In: Proceedings of the GermEval 2020 Task 1 Workshop in conjunction with the 5th SwissText & 16th KONVENS Joint Conference 2020, pages 23–29 [pdf] [bib]

Eva Huber and Çağrı Çöltekin (2020) Reproduction and Replication: A Case Study with Automatic Essay Scoring. In: Proceedings of The 12th Language Resources and Evaluation Conference, pages 5603–5613 [pdf] [bib]

Çağrı Çöltekin (2019) Cross-lingual morphological inflection with explicit alignment. In: Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 71–79 [pdf] [bib]

Özlem Çetinoğlu and Çağrı Çöltekin (2019) Challenges of Annotating a Code-Switching Treebank. In: Proceedings of the 18th International Workshop on Treebanks and Linguistic Theories (TLT, SyntaxFest 2019), pages 82–90 [pdf] [bib]

Nianheng Wu, Eric DeMattos, Kwok So, Pin-zhen Chen and Çağrı Çöltekin (2019) Language Discrimination and Transfer Learning for Similar Languages: Experiments with Feature Combinations and Adaptation. In: Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects, pages 54–63 [pdf] [bib]

Çağrı Çöltekin and Jeremy Barnes (2019) Neural and Linear Pipeline Approaches to Cross-lingual Morphological Analysis. In: Proceedings of the Sixth Workshop on NLP for Similar Languages, Varieties and Dialects, pages 153–164 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2018) Tübingen-Oslo system: Linear regression works the best at Predicting Current and Future Psychological Health from Childhood Essays in the CLPsych 2018 Shared Task. ArXiv e-prints, [pdf] [bib]

Aleksandrs Berdicevskis, Çağrı Çöltekin, Katharina Ehret, Kilu von Prince, Daniel Ross, Bill Thompson, Chunxiao Yan, Vera Demberg, Gary Lupyan, Taraka Rama and Christian Bentz (2018) Using Universal Dependencies in cross-linguistic complexity research. In: Proceedings of the Second Workshop on Universal Dependencies (UDW 2018), pages 8–17 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2018) Drug-Use Identification from Tweets with Word and Character N-Grams. In: Proceedings of the 2018 EMNLP Workshop SMM4H: The 3rd Social Media Mining for Health Applications Workshop & Shared Task, pages 52–53 [pdf] [bib]

Inna Pirina and Çağrı Çöltekin (2018) Identifying Depression on Reddit: The Effect of Training Data. In: Proceedings of the 2018 EMNLP Workshop SMM4H: The 3rd Social Media Mining for Health Applications Workshop & Shared Task, pages 9–12 [pdf] [bib]

Pavel Sofroniev and Çağrı Çöltekin (2018) Phonetic Vector Representations for Sound Sequence Alignment. In: Proceedings of the Fifteenth Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 111–116 [pdf] [bib]

Çağrı Çöltekin, Taraka Rama and Verena Blaschke (2018) Tübingen-Oslo Team at the VarDial 2018 Evaluation Campaign: An Analysis of N-gram Features in Language Variety Identification. In: Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), pages 55–65 [pdf] [bib]

Natalie Boll-Avetisyan, Jessie Nixon, Tomas Lentz, Liquan Liu, Sandrien van Ommen, Çağrı Çöltekin and Jacolien van Rij (2018) Neural response development during distributional learning. In: Proceedings of INTERSPEECH 2018, pages 1432–1436 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2018) Tübingen-Oslo at SemEval-2018 Task 2: SVMs perform better than RNNs at Emoji Prediction. In: Proceedings of the 12th International Workshop on Semantic Evaluation (SemEval-2018), pages 34-–38 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2018) Exploiting Universal Dependencies Treebanks for Measuring Morphosyntactic Complexity. In: Proceedings of First Workshop on Measuring Language Complexity, pages 1–7 [pdf] [bib]

Jessie Nixon, Natalie Boll-Avetisyan, Tomas Lentz, Sandrien van Ommen, Brigitta Keij, Çağrı Çöltekin, Liquan Liu and Jacolien van Rij (2018) Short-term exposure enhances perception of both between- and within-category acoustic information. In: Proceedings of Speech Prosody 9, pages 114–118 [pdf] [bib]

Amir More, Özlem Çetinoğlu, Çağrı Çöltekin, Nizar Habash, Benoît Sagot, Djamé Seddah, Dima Taji and Reut Tsarfaty (2018) CoNLL-UL: Universal Morphological Lattices for Universal Dependency Parsing. In: Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC'18), pages 3847–3853 [pdf] [bib]

Francis Tyers, Jonathan Washington, Çağrı Çöltekin and Aibek Makazhanov (2017) An assessment of Universal Dependency annotation guidelines for Turkic languages. In: 5th International Conference on Turkic Language Processing (TURKLANG 2017), pages 356-377 [pdf] [bib]

Taraka Rama and Çağrı Çöltekin (2017) Fewer features perform well at Native Language Identification task. In: Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications, pages 255–260 [pdf] [bib]

Çağrı Çöltekin, Ben Campbell, Erhard Hinrichs and Heike Telljohann (2017) Converting the TüBa-D/Z Treebank of German to Universal Dependencies. In: Proceedings of the NoDaLiDa 2017 Workshop on Universal Dependencies (UDW 2017), pages 27–37 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2017) Tübingen system in VarDial 2017 shared task: experiments with language identification and cross-lingual parsing. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), pages 146–155 [pdf] [bib]

Taraka Rama, Çağrı Çöltekin and Pavel Sofroniev (2017) Computational analysis of Gondi dialects. In: Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial), pages 26–35 [pdf] [bib]

Çağrı Çöltekin (2017) Using Predictability for Lexical Segmentation. Cognitive Science, 41:1988-2021 [pdf] [bib]

Taraka Rama and Çağrı Çöltekin (2016) LSTM Autoencoders for Dialect Analysis. In: Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3), pages 25–32 [pdf] [bib]

Çağrı Çöltekin and Taraka Rama (2016) Discriminating Similar Languages with Linear SVMs and Neural Networks. In: Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial3), pages 15–24 [pdf] [bib]

Çağrı Çöltekin (2016) (When) do we need inflectional groups? In: Proceedings of The First International Conference on Turkic Computational Linguistics, [pdf] [bib]

Özlem Çetinoğlu and Çağrı Çöltekin (2016) Part of Speech Annotation of a Turkish-German Code-Switching Corpus. In: Proceedings of the 10th Linguistic Annotation Workshop (LAW-X), pages 120–130 [pdf] [bib]

Jianqiang Ma, Çağrı Çöltekin and Erhard Hinrichs (2016) Learning Phone Embeddings for Word Segmentation of Child-Directed Speech. In: Proceedings Workshop on Cognitive Aspects of Computational Language Learning, pages 53–63 [pdf] [bib]

Umut Sulubacak, Memduh Gokirmak, Francis Tyers, Çağrı Çöltekin, Joakim Nivre and Gülşen Eryiğit (2016) Universal Dependencies for Turkish. In: Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 3444–3454 [pdf] [bib]

Therese Leinonen, Çağrı Çöltekin and John Nerbonne (2016) Using Gabmap. Lingua, 178:71–83 [pdf] [bib]

Carmen Klaussner, John Nerbonne and Çağrı Çöltekin (2015) Finding Characteristic Features in Stylometric Analysis. Journal of Digital Scholarship in the Humanities, 30:i114–i129 [pdf] [bib]

Çağrı Çöltekin (2015) A grammar-book treebank of Turkish. In: Proceedings of the 14th workshop on Treebanks and Linguistic Theories (TLT 14), pages 35–49 [pdf] [bib]

Erhard Hinrichs, Daniël de Kok and Çağrı Çöltekin (2015) Treebank Data and Query Tools for Rare Syntactic Constructions. In: Proceedings of the 14th workshop on Treebanks and Linguistic Theories (TLT 14), pages 106–118 [pdf] [bib]

Çağrı Çöltekin (2015) Turkish NLP web services in the WebLicht environment. In: Proceedings of the CLARIN Annual Conference, [pdf] [bib]

Çağrı Çöltekin (2015) Units in segmentation: a computational investigation. In: Proceedings of EMNLP 2015 Workshop on Cognitive Aspects of Computational Language Learning, pages 55–64 [pdf] [bib]

Güliz Güneş and Çağrı Çöltekin (2015) Mapping to prosody: not all parentheticals are alike. In: Parenthetical Verbs, pages 287–332 [pdf] [bib]

Çağrı Çöltekin (2014) A Set of Open Source Tools for Turkish Natural Language Processing. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 1079–1086 [pdf] [bib]

Çağrı Çöltekin and John Nerbonne (2014) An explicit statistical model of learning lexical segmentation using multiple cues. In: Proceedings of EACL 2014 Workshop on Cognitive Aspects of Computational Language Learning, pages 19–28 [pdf] [bib]

Lili Szabó and Çağrı Çöltekin (2013) A linear model for exploring types of vowel harmony. Computational Linguistics in the Netherlands Journal, 3:174-192 [pdf] [bib]

Jelena Prokić, Çağrı Çöltekin and John Nerbonne (2012) Detecting shibboleths. In: Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH, pages 72–80 [pdf] [bib]

Çağrı Çöltekin (2011) Catching Words in a Stream of Speech: Computational simulations of segmenting transcribed child-directed speech. ' PhD thesis, University of Groningen [pdf] [bib]

Çağrı Çöltekin (2010) A Freely Available Morphological Analyzer for Turkish. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC 2010), pages 820–827 [pdf] [bib]

Çağrı Çöltekin (2010) Improving Successor Variety for Morphological Segmentation. In: Proceedings of the 20th Meeting of Computational Linguistics in the Netherlands, pages 13–28 [pdf] [bib]

Çağrı Çöltekin (2009) Modeling Acquisition of Word Structure with Lexicalized Grammar Learning. Short paper presented at the Workshop on Psychocomputational Models of Human Language Acquisition [bib]

Xuchen Yao, Jianqiang Ma, Sergio Duarte and Çağrı Çöltekin (2009) An Inference-rules based Categorial Grammar Learner for Simulating Language Acquisition. In: Proceedings of the 18th Annual Belgian-Dutch Conference on Machine Learning, pages 29–37 [pdf] [bib]

Xuchen Yao, Jianqiang Ma, Sergio Duarte and Çağrı Çöltekin (2009) Unsupervised Syntax Learning with Categorial Grammars using Inference Rules. In: Proceedings of The 14th Student Session of the European Summer School for Logic, Language, [pdf] [bib]

Çağrı Çöltekin and Cem Bozşahin (2007) Syllables, Morphemes and Bayesian Computational Models of Acquiring a Word Grammar. In: Proceedings of 29th Annual Meeting of Cognitive Science Society, pages 887–892 [pdf] [bib]

Çağrı Çöltekin (2006) From Syllable to Meaning: Effects of Knowledge of Syllable in Learning The Meaning Bearing Units of Language. Master's thesis, Middle East Technical University [bib]

Publications