Main Conference: Long paper
Xin Zheng, Hongyu Lin, Xianpei Han and Le Sun
Toward Unified Controllable Text Generation via Regular Expression Instruction
Zhenwen Liang, Jipeng Zhang and Xiangliang Zhang
Don’t be Blind to Questions: Question-Oriented Math Word Problem Solving
Chunpeng Ma and Takuya Makino
SILVER: Self Data Augmentation for Out-of-Scope Detection in Dialogues
Potsawee Manakul, Adian Liusie and Mark Gales
MQAG: Multiple-choice Question Answering and Generation for Assessing Information Consistency in Summarization
Hongru Wang, Zezhong WANG, Wai Chung Kwan and Kam-Fai Wong
MCML: A Novel Memory-based Contrastive Meta-Learning Method for Few Shot Slot Tagging
Fiona Anting Tan, Hansi Hettiarachchi, Ali Hürriyetoğlu, Nelleke Oostdijk, Tommaso Caselli, Tadashi Nomoto, Onur Uca, Farhana Ferdousi Liza and See-Kiong Ng
RECESS: Resource for Extracting Cause, Effect, and Signal Spans
Atharva Naik, Soumitra Das, Jyothi Vedurada and Somak Aditya
SYNC: A Structurally Guided Hard Negative Curricula for Generalizable Neural Code Search
Ting-Rui Chiang
On a Benefit of Masked Language Model Pretraining: Robustness to Simplicity Bias
Shamik Roy, Raphael Shu, Nikolaos Pappas, Elman Mansimov, Yi Zhang, Saab Mansour and Dan Roth
Conversation Style Transfer using Few-Shot Learning
David Ifeoluwa Adelani, Marek Masiak, Israel Abebe Azime, Jesujoba Alabi, Atnafu Lambebo Tonja, Christine Mwase, Odunayo Ogundepo, Bonaventure F. P. Dossou, Akintunde Oladipo, Doreen Nixdorf, Chris Chinenye Emezue, sana al-azzawi, Blessing Sibanda, Davis David, Lolwethu Ndolela, Jonathan Mukiibi, Tunde Ajayi, Tatiana Moteu, Brian Odhiambo, Abraham Owodunni, Nnaemeka Obiefuna, Muhidin Mohamed, Shamsuddeen Hassan Muhammad, Teshome Mulugeta Ababu, Saheed Abdullahi Salahudeen, Mesay Gemeda Yigezu, Tajuddeen Gwadabe, Idris Abdulmumin, Mahlet Taye, Oluwabusayo Awoyomi, Iyanuoluwa Shode, Tolulope Adelani, Habiba Abdulganiyu, Abdul-Hakeem Omotayo, Adetola Adeeko, Abeeb Afolabi, Anuoluwapo Aremu, Olanrewaju Samuel, Clemencia Siro, Wangari Kimotho, Onyekachi Ogbu, Chinedu Mbonu, Chiamaka Chukwuneke, Samuel Fanijo, Jessica Ojo, Oyinkansola Awosan, Tadesse Kebede, Toadoum Sari Sakayo, Pamela Nyatsine, Freedmore Sidume, Oreen Yousuf, Mardiyyah Oduwole, kanda Tshinu, Ussen Kimanuka, Thina Diko, Siyanda Nxakama, Sinodos Nigusse, Abdulmejid Johar, Shafie Mohamed, Fuad Mire Hassan, Moges Ahmed Mehamed, Evrard Ngabire, Jules Jules, Ivan Ssenkungu and Pontus Stenetorp
MasakhaNEWS: News Topic Classification for African languages
Ofri Masad, Kfir Bar and Amir Cohen
Automatic Translation of Span-Prediction Datasets
Xiaonan Xu and Haoshuo Chen
Human-Like Distractor Response in Vision-Language Model
William Soto Martinez, Yannick Parmentier and Claire Gardent
Phylogeny-Inspired Soft Prompts For Data-to-Text Generation in Low-Resource Languages
Michael Beukman and Manuel Fokam
Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition
Danae Sánchez Villegas, Catalina Goanta and Nikolaos Aletras
A Multimodal Analysis of Influencer Content on Twitter
Apoorva Singh, Raghav Jain and Sriparna Saha
Reimagining Complaint Analysis: Adopting Seq2Path for a Generative Text-to-Text Framework
Yan Meng, Liangming Pan, Yixin Cao and Min-Yen Kan
FollowupQG: Towards information-seeking follow-up question generation
Bosung Kim, Hayate Iso, Nikita Bhutani, Estevam Hruschka, Ndapa Nakashole and Tom Mitchell
Zero-shot Triplet Extraction by Template Infilling
Kelvin Han and Claire Gardent
Generating and Answering Simple and Complex Questions from Text and from Knowledge Graphs
Qing Lyu, Shreya Havaldar, Adam Stein, Li Zhang, Delip Rao, Eric Wong, Marianna Apidianaki and Chris Callison-Burch
Faithful Chain-of-Thought Reasoning
Raquel G. Alhama, Ruthe Foushee, Daniel Byrne, Allyson Ettinger, Susan Goldin-Meadow and Afra Alishahi
Linguistic Productivity: the Case of Determiners in English
DaHyun Jung, Sugyeong Eo, Chanjun Park, Hyeonseok Moon, Jaehyung Seo and Heuiseok Lim
Informative Evidence-guided Prompt-based Fine-tuning for English-Korean Critical Error Detection
Alberto Muñoz-Ortiz, David Vilares and Carlos Gómez-Rodríguez
Assessment of Pre-Trained Models Across Languages and Grammars
Xiang Dai, Sarvnaz Karimi and Stephen Wan
Rethinking the Role of Entity Type in Relation Classification
Minkyung Park and Byung-Jun Lee
Improving Neural Machine Translation with Offline Evaluations
Ashkan Kazemi, Artem Abzaliev, Naihao Deng, Rui Hou, Scott Hale, Veronica Perez-Rosas and Rada Mihalcea
Query Rewriting for Effective Misinformation Discovery
Yiran Wang, Taro Watanabe, Masao Utiyama and Yuji Matsumoto
24-bit Languages
Sabit Hassan and Malihe Alikhani
DisCGen: A Framework for Discourse-Informed Counterspeech Generation
Md Shihab Shahriar, Ahmad Al Fayad Chowdhury, Md. Amimul Ehsan and Abu Raihan Kamal
Question Answer Generation in Bengali: Mitigating the scarcity of QA datasets in a low-resource language
Bradley Hauer and Grzegorz Kondrak
One Sense per Translation
Jonathan Pilault, Xavier Garcia, Arthur Bražinskas and Orhan Firat
Interactive-Chain-Prompting: Ambiguity Resolution for Crosslingual Conditional Generation with Interaction
Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu and Joshua Garland
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
Peter Vickers, Loic Barrault, Emilio Monti and Nikolaos Aletras
We Need to Talk About Classification Evaluation Metrics in NLP
Liangming Pan, Yunxiang Zhang and Min-Yen Kan
Investigating Zero- and Few-shot Generalization in Fact Verification
Liangming Pan, Wenhu Chen, Min-Yen Kan and William Yang Wang
Attacking Open-domain Question Answering by Injecting Misinformation
Sagi Shaier, Kevin Bennett, Lawrence Hunter and Katharina Kann
Emerging Challenges in Personalized Medicine: Assessing Demographic Effects on Biomedical Question Answering Systems
Nick McKenna, Tianyi Li, Mark Johnson and Mark Steedman
Smoothing Entailment Graphs with Language Models
Pavlos Vougiouklis, Nikos Papasarantopoulos, Danna Zheng, David Tuckey, Chenxin Diao, Zhili Shen and Jeff Pan
FastRAT: Fast and Efficient Cross-lingual Text-to-SQL Semantic Parsing
Abdellah El Mekki, Muhammad Abdul-Mageed, ElMoatez Billah Nagoudi, Ismail Berrada and Ahmed Khoumsi
ProMap: Effective Bilingual Lexicon Induction via Language Model Prompting
Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah and Huan Liu
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
Siva Uday Sampreeth Chebolu, Franck Dernoncourt, Nedim Lipka and Thamar Solorio
A Review of Datasets for Aspect-based Sentiment Analysis
Vincent Nguyen, Sarvnaz Karimi, Maciej Rybinski and Zhenchang Xing
MedRedQA for Medical Consumer Question Answering: Dataset, Tasks, and Neural Baselines
Jacob Tyo, Bhuwan Dhingra and Zachary C. Lipton
Valla: Standardizing and Benchmarking Authorship Attribution and Verification Through Empirical Evaluation and Comparative Analysis
Aritra Raut, Sriparna Saha, Anutosh Maitra and Roshni Ramnani
Sentiment Aided Graph Attentive Contextualization for Task Oriented Negotiation Dialogue Generation
Yejin Bang, Samuel Cahyawijaya, Nayeon Lee, Wenliang Dai, Dan Su, Bryan Wilie, Holy Lovenia, Ziwei Ji, Tiezheng Yu, Willy Chung, Quyet V. Do, Yan Xu and Pascale Fung
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Maggie Liu, Jing Wang and Daniel Preotiuc-Pietro
Analyzing and Predicting Persistence of News Tweets
Zihan Liu, Zewei Sun, Shanbo Cheng, Shujian Huang and Mingxuan Wang
Only 5% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation
Gleb Kuzmin, Artem Vazhentsev, Artem Shelmanov, Xudong Han, Simon Suster, Maxim Panov, Alexander Panchenko and Timothy Baldwin
Uncertainty Estimation for Debiased Models: Does Fairness Hurt Reliability?
Ming Li and Ruihong Huang
Semi-supervised News Discourse Profiling with Contrastive Learning
Iffat Maab, Edison Marrese-Taylor and Yutaka Matsuo
Target-Aware Contextual Political Bias Detection in News
Mrinal Rawat, Hithesh Sankararaman and Victor Barres
Controllable Discovery of Intents: Incremental Deep Clustering Using Semi-Supervised Contrastive Learning
Arda Uzunoglu and Gözde Şahin
Benchmarking Procedural Language Understanding for Low-Resource Languages: A Case Study on Turkish
Federico Martelli, Luigi Procopio, Edoardo Barba and Roberto Navigli
LexicoMatic: Automatic Creation of Multilingual Lexical-Semantic Dictionaries
Minh Nguyen and Nancy Chen
FiRo: Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Hsiu-Yu Yang and Carina Silberer
Implicit Affordance Acquisition via Causal Action–Effect Modeling in the Video Domain
Deepanway Ghosal, Somak Aditya and Monojit Choudhury
Prover: Generating Intermediate Steps for NLI with Commonsense Knowledge Retrieval and Next-Step Prediction
Bar Iluz, Tomasz Limisiewicz, Gabriel Stanovsky and David Mareček
Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation
Ritam Dutt, Sopan Khosla, Vinayshekhar Bannihatti Kumar and Rashmi Gangadharaiah
GrailQA++: A Challenging Zero-Shot Benchmark for Knowledge Base Question Answering
Xincan Feng, Hidetaka Kamigaito, Katsuhiko Hayashi and Taro Watanabe
Model-based Subsampling for Knowledge Graph Completion
Samuel Cahyawijaya, Holy Lovenia, Fajri Koto, Dea Adhista, Emmanuel Dave, Sarah Oktavianti, Salsabil Akbar, Jhonson Lee, Nuur Shadieq, Tjeng Wawan Cenggoro, hanung linuwih, Bryan Wilie, Galih Muridan, Genta Winata, David Moeljadi, Alham Fikri Aji, Ayu Purwarianti and Pascale Fung
NusaWrites: Constructing High-Quality Corpora for Underrepresented and Extremely Low-Resource Languages
Gitanjali Kumari, Pranali Shinde and Asif Ekbal
The Persuasive Memescape: Understanding Effectiveness and Societal Implications of Internet Memes
Jisu Shin, Hoyun Song, Huije Lee, Fitsum Gaim and Jong Park
Generation of Korean Offensive Language by Leveraging Large Language Models via Prompt Design
Bryan Wilie, Yan Xu, Willy Chung, Samuel Cahyawijaya, Holy Lovenia and Pascale Fung
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue Systems
Xuan ZHANG and Wei Gao
Towards LLM-based Fact Verification on News Claims with a Hierarchical Step-by-Step Prompting Method
Wenyu Huang, Mirella Lapata, Pavlos Vougiouklis, Nikos Papasarantopoulos and Jeff Pan
Retrieval Augmented Generation with Rich Answer Encoding
Huiju Kim, Youjin Kang and SangKeun Lee
Examining Consistency of Visual Commonsense Reasoning based on Person Grounding
Chunkit Chan, Xin Liu, Tsz Ho CHAN, Jiayang Cheng, Yangqiu Song, Ginny Wong and Simon See
Self-Consistent Narrative Prompts on Abductive Natural Language Inference
Kisu Yang
KoBigBird-large: Transformation of Transformer for Korean Language Understanding
Levon Haroutunian, Zhuang Li, Lucian Galescu, Philip Cohen, Raj Tumuluri and Gholamreza Haffari
Reranking for Natural Language Generation from Logical Forms: A Study based on Large Language Models
Daryna Dementieva, Daniil Moskovskiy, David Dale and Alexander Panchenko
Exploring Methods for Cross-lingual Text Style Transfer: The Case of Text Detoxification
Md Tawkat Islam Khondaker, Muhammad Abdul-Mageed and Laks Lakshmanan, V.S.
PACT: Pretraining with Adversarial Contrastive Learning for Text Classification
Pramit Bhattacharyya, Joydeep Mondal, Subhadip Maji and Arnab Bhattacharya
VACASPATI: A Diverse Corpus of Bangla Literature
Main Conference: Short paper
Fei Wang, Kuan-Hao Huang, Kai-Wei Chang and Muhao Chen
Self-Augmentation Improves Zero-Shot Cross-Lingual Transfer
Tianhui Zhang, Danushka Bollegala and Bei Peng
Learning to Predict Concept Ordering for Common Sense Generation
Matteo Gabburo, Siddhant Garg, Rik Koncel-Kedziorski and Alessandro Moschitti
SQUARE: Automatic Question Answering Evaluation using Multiple Positive and Negative References
Masahiro Kaneko, Danushka Bollegala and Naoaki Okazaki
The Impact of Debiasing on the Performance of Language Models in Downstream Tasks is Underestimated
Ming-Xuan Shi, Chung-Chi Chen, Hen-Hsen Huang and Hsin-Hsi Chen
Enhancing Volatility Forecasting in Financial Markets: A General Numeral Attachment Dataset for Understanding Earnings Calls
Pratik Saini, Tapas Nayak and Indrajit Bhattacharya
Do the Benefits of Joint Models for Relation Extraction Extend to Document-level Tasks?
Ana Ezquerro, Carlos Gómez-Rodríguez and David Vilares
On the Challenges of Fully Incremental Neural Dependency Parsing
Yukun Huang, Kun Qian and Zhou Yu
Learning a Better Initialization for Soft Prompts via Meta-Learning
Hiroki Nomoto
Issues Surrounding the Use of ChatGPT in Similar Languages: The Case of Malay and Indonesian
Jyotsana Khatri, Vivek Srivastava and Lovekesh Vig
Can You Translate for Me? Code-Switched Machine Translation with Large Language Models
Genta Winata, Lingjue Xie, Karthik Radhakrishnan, Yifan Gao and Daniel Preotiuc-Pietro
Efficient Zero-Shot Cross-lingual Inference via Retrieval
Vyas Raina and Mark Gales
Minimum Bayes’ Risk Decoding for System Combination of Grammatical Error Correction Systems
Sagi Shaier, Lawrence Hunter and Katharina Kann
Who Are All The Stochastic Parrots Imitating? They Should Tell Us!
Yilun Zhu, Siyao Peng, Sameer Pradhan and Amir Zeldes
Incorporating Singletons and Mention-based Features in Coreference Resolution via Multi-task Learning for Better Generalization
Jiangshu Du, Congying Xia, Wenpeng Yin, Tingting Liang and Philip Yu
All Labels Together: Low-shot Intent Detection with an Efficient Label Semantic Encoding Paradigm
Farhad Moghimifar, Fatemeh Shiri, Van Nguyen, Yuan-Fang Li and Gholamreza Haffari
Theia: Weakly Supervised Multimodal Event Extraction from Incomplete Data
Rohit Jain, Huda Khayrallah, Roman Grundkiewicz and Marcin Junczys-Dowmunt
Perplexity-Driven Case Encoding Needs Augmentation for CAPITALIZATION Robustness
Nengzheng Jin, Dongfang Li, Junying Chen, Joanna Siebert and Qingcai Chen
Enhancing Open-Domain Table Question Answering via Syntax- and Structure-aware Dense Retrieval
Marzia Nouri, Mahsa Amani, Reihaneh Zohrabi and Ehsaneddin Asgari
The Language Model, Resources, and Computational Pipelines for the Under-Resourced Iranian Turkic
Reihaneh Zohrabi, Mostafa Masumi, Omid Ghahroodi, Parham AbedAzad, Hamid Beigy, Mohammad Hossein Rohban and Ehsaneddin Asgari
Borderless Azerbaijani Processing: Linguistic Resources and a Transformer-based Approach for Azerbaijani Transliteration
Yulong Wu, Viktor Schlegel and Riza Batista-Navarro
Are Machine Reading Comprehension Systems Robust to Context Paraphrasing?
Biaoyan Fang, Trevor Cohn, Timothy Baldwin and Lea Frermann
It’s not only What You Say, It’s also Who It’s Said to: Counterfactual Analysis of Interactive Behavior in the Courtroom
Findings
Wenting Zhao, Ye Liu, Yao Wan, Yibo Wang, Zhongfen Deng and Philip S. Yu
Localize, Retrieve and Fuse: A Generalized Framework for Free-Form Question Answering over Tables
Yibo Wang, Wenting Zhao, Yao Wan, Zhongfen Deng and Philip Yu
Named Entity Recognition via Machine Reading Comprehension: A Multi-Task Learning Approach
Imtiaz Karim, Kazi Samin Mubasshir, Mirza Masfiqur Rahman and Elisa Bertino
SPEC5G: A Dataset for 5G Cellular Network Protocol Analysis
Minseok Choi, Hyesu Lim and Jaegul Choo
PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration
Tiezheng Yu, Ziwei Ji and Pascale Fung
Improving Query-Focused Meeting Summarization with Query-Relevant Knowledge
Jimin Hong, ChaeHun Park and Jaegul Choo
Learning to Diversify Neural Text Generation via Degenerative Model
Danushka Bollegala, Shuichi Otake, Tomoya Machide and Ken-ichi Kawarabayashi
A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings
Kasturi Bhattacharjee, Kathleen McKeown and Rashmi Gangadharaiah
PhraseSumm: Abstractive Short Phrase Summarization
Haonan Li, Martin Tomko and Timothy Baldwin
Location Aware Modular Biencoder for Tourism Question Answering
Teven Le Scao and Claire Gardent
Joint Representations of Text and Knowledge Graphs for Retrieval and Evaluation
Haopeng Zhang, Sangwoo Cho, Kaiqiang Song, Xiaoyang Wang, Hongwei Wang, Jiawei Zhang and Dong Yu
Unsupervised Multi-document Summarization with Holistic Inference
Irina Nikishina, Polina Chernomorchenko, Anastasiia Demidova, Alexander Panchenko and Chris Biemann
Predicting Terms in IS-A Relations with Pre-trained Transformers
Zhaomin Xiao, Yan Huang and Eduardo Blanco
Context Helps Determine Spatial Knowledge from Tweets
Chenyang Huang, Fei Huang, Zaixiang Zheng, Osmar Zaïane, Hao Zhou and Lili Mou
Multilingual Non-Autoregressive Machine Translation without Knowledge Distillation
Fumiyo Fukumoto and Shou Asakawa
Knowledge Injection with Perturbation-based Constrained Attention Network for Word Sense Disambiguation
Pierre Colombo, Maxime Peyrard, Nathan Noiry, Robert West and Pablo Piantanida
The Glass Ceiling of Automatic Evaluation in Natural Language Generation
Pierre Colombo, Nathan Noiry, Guillaume Staerman and Pablo Piantanida
A Novel Information Theoretic Objective to Disentangle Representations for Fair Classification
Hayastan Avetisyan and David Broneske
Large Language Models and Low-Resource Languages: An Examination of Armenian NLP
Xiang Li, Fangyu Lei, Shizhu He, Kang Liu and Jun Zhao
Multi-Target Semantic Parsing with Collaborative Deliberation Network
Xun Yao, Junlong Ma, Xinrong Hu, Jie Yang and Yuan-Fang Li
Improving Machine Reading Comprehension through A Simple Masked-Training Scheme
Yunhao Zhang, Chong Li, Xiaohan Zhang, Xinyi Dong and Shaonan Wang
A Comprehensive Neural and Behavioral Task Taxonomy Method for Transfer Learning in NLP
Tanmay Chavan, Omkar Gokhale, Aditya Kane, Shantanu Patankar and Raviraj Joshi
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks
Dheeraj Rajagopal, Vivek Khetan, Bogdan Sacaleanu, Anatole Gershman, Andrew E. Fano Fano and Eduard Hovy
Template Filling for Controllable Commonsense Reasoning
Guy Yanko, Shahaf Pariente and Kfir Bar
Temporal Relation Classification in Hebrew
Vinayshekhar Bannihatti Kumar, Rashmi Gangadharaiah and Dan Roth
Privacy Adhering Machine Un-learning in NLP
Atakan Kara, Farrin Marouf Sofian, Andrew Bond and Gözde Şahin
GECTurk: Grammatical Error Correction and Detection Dataset for Turkish
Nikhil Mehta and Dan Goldwasser
Interactively Learning Social Media Representations Improves News Source Factuality Detection
Ritwik Mishra, Simranjeet Singh, Rajiv Ratn Shah, Ponnurangam Kumaraguru and Pushpak Bhattacharyya
IndIE: A Multilingual Open Information Extraction Tool For Indic Languages
Adian Liusie, Potsawee Manakul and Mark Gales
Mitigating Word Bias in Zero-shot Prompt-based Classifiers
Mauajama Firdaus, Priyanshu Priya and Asif Ekbal
Mixing It Up: Inducing Empathy and Politeness using Multiple Behaviour-aware Generators for Conversational Systems
Kevin Lin, Patrick Xia and Hao Fang
Few-Shot Adaptation for Parsing Contextual Utterances with LLMs
Yi Chen, Rui Wang, Haiyun Jiang, Shuming Shi and Ruifeng Xu
Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: An Empirical Study
Tharindu Ranasinghe and Marcos Zampieri
A Text-to-Text Model for Multilingual Offensive Language Identification
Yasuhide Miura and Takumi Takahashi
Few-shot Named Entity Recognition with Supported and Dependent Label Representations
Shakila Mahjabin Tonni and Mark Dras
What Learned Representations and Influence Functions Can Tell Us About Adversarial Examples
Giorgio Barnabò, Antonio Uva, Sandro Pollastrini, Chiara Rubagotti and Davide Bernardi
Supervised Clustering Loss for Clustering-Friendly Sentence Embeddings: an Application to Intent Clustering
Yang Zhong and Diane Litman
STRONG — Structure Controllable Legal Opinion Summary Generation