COIN: COmmonsense INference in Natural Language Processing

Workshop to be held in conjunction with EMNLP-IJCNLP in Hong Kong

November 3, 2019

Update: We are excited to announce the full workshop program, which you can view here

Research in natural language understanding and textual inference has advanced considerably in recent years, resulting in powerful models that are able to read and understand texts, even outperforming humans in some cases. However, it remains challenging to answer questions that go beyond the texts themselves, requiring the use of additional commonsense knowledge. Previous work has explored using both explicit representations of background knowledge (e.g., ConceptNet or NELL), and latent representations that capture some aspects of commonsense (e.g., OpenAI GPT). These and any other methods for representing and using commonsense in NLP are of interest to this workshop.

The COIN workshop aims at bringing together researchers that are interested in modeling commonsense knowledge, developing computational models thereof, and applying commonsense inference methods in NLP tasks. We are interested in any type of commonsense knowledge representation, and explicitly encourage work that makes use of knowledge bases and approaches developed to mine or learn commonsense from other sources. The workshop is also open for evaluation proposals that explore new ways of evaluating methods of commonsense inference, going beyond established natural language processing tasks.

The workshop will also include two shared tasks on common-sense machine reading comprehension in English, one based on everyday scenarios and one based on news events. See Shared Tasks for more details.

If you are participating or interested in participating in COIN, we welcome you to join the COIN mailing list on Google Groups. Follow the link and click "Join Group" to join.

Yejin Choi: Commonsense Intelligence: Cracking the Longstanding Challenge in AI

Despite considerable advances in deep learning, AI remains to be narrow and brittle. One fundamental limitation comes from its lack of commonsense intelligence: reasoning about everyday situations and events, which in turn, requires knowledge about how the physical and social world works. In this talk, I will share some of our recent efforts that attempt to crack commonsense intelligence.

First, I will introduce ATOMIC, the atlas of everyday commonsense knowledge and reasoning, organized as a graph of 877k if-then rules (e.g., “if X pays Y a compliment, then Y will likely return the compliment”). Next, I will introduce COMET, our deep neural networks that can learn from and generalize beyond the ATOMIC commonsense graph. Finally, I will present RAINBOW, a collection of seven benchmarks that aims to cover a wide spectrum of commonsense intelligence from natural language inference to adductive reasoning to visual commonsense reasoning. I will conclude the talk by discussing major open research questions, including the importance of algorithmic solutions to reduce incidental biases in data that can lead to overestimation of true AI capabilities.

Yejin Choi is an associate professor at the Paul G. Allen School of Computer Science & Engineering at the University of Washington and also a senior research manager at AI2 overseeing the project Mosaic. Her research interests include language grounding with vision, physical and social commonsense knowledge, language generation with long-term coherence, conversational AI, and AI for social good. She was a recepient of Borg Early Career Award (BECA) in 2018, among the IEEE’s AI Top 10 to Watch in 2015, a co-recipient of the Marr Prize at ICCV 2013, and a faculty advisor for the Sounding Board team that won the inaugural Alexa Prize Challenge in 2017. Her work on detecting deceptive reviews, predicting the literary success, and interpreting bias and connotation has been featured by numerous media outlets including NBC News for New York, NPR Radio, New York Times, and Bloomberg Business Week. She received her Ph.D. in Computer Science from Cornell University.
Michael Witbrock: Learning to Reason: from Question Answering to Problem Solving

Recent advances in Machine Learning applied to Natural Language Processing have resulted in systems with quite impressive scores on Question-Answering tests in text and simple visual domains. Similarly, Google Assistant and other similar systems are performing quite well in answering real questions by answer extraction. This progress is real, but it is limited in some important respects: the systems typically have worked by identifying a passage span to serve as an answer; more recently, in “Multi-hop” QA, they have started to work by forming a short linear chain of extracted relations from question to answer. This falls significantly short of human-problem solving, including question-answering: it does not recursively decompose problems for solution, it does not follow that decomposition to assemble answers, and it does not store and apply salient background knowledge for decomposition, partial solution, or answer composition. Although it's at the early stage, the aim of our Broad AI Lab is to learn from the far-from-general capabilities of symbolic AI to extend our reach, especially in problem-solving over text, by applying learning to bridge these current short-falls. In this talk , I'll try to characterise the problem-solving problem and the baseline state-of-the-art, describe some preliminary previous work done with the Learning to Reason team at IBM Research, and sketch a programme towards broader, learning-based, quasi-symbolic AI. We hope this programme will extend the reach of AI-based problem solving, and especially question-answering.

Michael Witbrock is a professor of computer science at The University of Auckland, building a research group, the Broad AI Lab, integrating machine learning, reasoning and natural language understanding, with an additional focus on maximizing the near-term benefit of AI to NZ entrepreneurs and business, and more generally achieving the best social and civilizational impacts of increasingly powerful AI. Prof. Witbrock's PhD is in Computer Science from Carnegie Mellon, and he holds a BSc(Hons) in Psychology from Otago. Before joining the University, he was a Distinguished Research Staff Member and manager of the Reasoning group at IBM T J Watson Research Center in Yorktown Heights, NY.

Workshop Program

9:00	Opening
9:10	Invited talk Commonsense Intelligence---Cracking the Longstanding Challenge in AI ^[PDF] Yejin Choi
10:10	Understanding Commonsense Inference Aptitude of Deep Contextual Representations Jeff Da and Jungo Kasai
10:30	Coffee break
11:00	A Hybrid Neural Network Model for Commonsense Reasoning Pengcheng He, Xiaodong Liu, Weizhu Chen, Jianfeng Gao
11:20	Towards Generalizable Neuro-Symbolic Systems for Commonsense Question Answering Kaixin Ma, Jonathan Francis, Quanyang Lu, Eric Nyberg, Alessandro Oltramari
11:40	When Choosing Plausible Alternatives, Clever Hans can be Clever Pride Kavumba, Naoya Inoue, Benjamin Heinzerling, Keshav Singh, Paul Reisert, Kentaro Inui
12:00	Commonsense about Human Senses: Labeled Data Collection Processes Ndapa Nakashole
12:20	Lunch break
14:00	Invited talk Learning to Reason: from Question Answering to Problem Solving ^[PDF] Michael Witbrock
15:00	Extracting Common Inference Patterns from Semi-Structured Explanations Sebastian Thiem and Peter Jansen
15:20	Poster session & coffee break
	Commonsense Inference in Natural Language Processing (COIN) - Shared Task Report Simon Ostermann, Sheng Zhang, Michael Roth, Peter Clark
	KARNA at COIN Shared Task 1: Bidirectional Encoder Representations from Transformers with relational knowledge for machine comprehension with common sense Yash Jain and Chinmay Singh
	IIT-KGP at COIN 2019: Using pre-trained Language Models for modeling Machine Comprehension Prakhar Sharma and Sumegh Roychowdhury
	Jeff Da at COIN - Shared Task Jeff Da
	Pingan Smart Health and SJTU at COIN - Shared Task: utilizing Pre-trained Language Models and Common-sense Knowledge in Machine Reading Tasks Xiepeng Li, Zhexi Zhang, Wei Zhu, Zheng Li, Yuan Ni, Peng Gao, Junchi Yan, Guotong Xie
	BLCU-NLP at COIN-Shared Task1: Stagewise Fine-tuning BERT for Commonsense Inference in Everyday Narrations Chunhua Liu and Dong Yu
16:20	Commonsense inference in human-robot communication Aliaksandr Huminski, Yan Bin Ng, Kenneth Kwok, Francis Bond
16:40	Diversity-aware Event Prediction based on a Conditional Variational Autoencoder with Reconstruction Hirokazu Kiyomaru, Kazumasa Omura, Yugo Murawaki, Daisuke Kawahara, Sadao Kurohashi
17:00	Can a Gorilla Ride a Camel? Learning Semantic Plausibility from Text Ian Porada, Kaheer Suleman, Jackie Chi Kit Cheung
17:15	How Pre-trained Word Representations Capture Commonsense Physical Comparisons Pranav Goel, Shi Feng, Jordan Boyd-Graber

This workshop includes two shared tasks on English reading comprehension using commonsense knowledge. The first task is a multiple choice reading comprehension task on everyday narrations. The second task is a cloze task on news texts.

In contrast to other machine comprehension tasks and workshops, our focus will be on the inferences over commonsense knowledge about events and participants that are required for text understanding. Participants are encouraged to use any external resources that could improve their systems. Below we give a list of external resources that we expect to be helpful for the tasks.

If you make submissions to one of the shared tasks, they will be added to the development data leaderboard. The test data for both tasks will not be public, but you will have to submit your models so that we can run them on the test data. During the evaluation pahse (first 3 weeks of June), your submissions will count towards the final ranking on the test data. The final leaderboard will be made public only after the evaluation phase ends.

The development set leaderboard will be updated approx. once a week with all current submissions.

If you want to participate or have any questions, please join the google group for participants. We'll post updates in the group, and answer questions on the shared task and workshop.

Shared Task 1: Commonsense inference in everyday narrations

Shared Task 2: Commonsense inference in news articles

Commonsense Knowledge Resources

Commonsense Knowledge Bases

ConceptNet [paper|data]
WebChild [paper|web]
NELL [paper|web]
ATOMIC [paper|web]
ACL list of RTE Knowledge Resources [web]

Script Knowledge Bases

DeScript [paper|data]
RKP [paper|data]
OMCS stories [paper|data|web]

Script Knowledge Representations

narrative chains [paper1 paper2|data]
event embeddings [paper]
event paraphrase sets [paper]

Late March 2019: Release of shared-task training data
May 10, 2019: First call for workshop papers
June 14, 2019: Beginning of evaluation phase for the shared tasks
June 14, 2019: Second call for workshop papers
July 5 19, 2019: End of evaluation phase for the shared tasks
August 19 22, 2019: Due date for workshop and shared task papers
September 16, 2019: Notification of acceptance
September 30, 2019: Camera-ready papers due
November 3, 2019: Workshop date

All deadlines refer to 11:59pm GMT -12 hours ("anywhere in the world").

Peter Clark, Allen Institute for AI
Simon Ostermann, Saarland University
Michael Roth, Saarland University / University of Stuttgart
Sheng Zhang, Johns Hopkins University

Program Committee

Malihe Alikhani, Rutgers University

Ken Barker, IBM Research

Yonatan Bisk, University of Washington

Nathanael Chambers, United States Naval Academy

Hans Chalupsky, USC Information Sciences Institute

Vera Demberg, Saarland University

Andrew S. Gordon, University of Southern California

Jonathan Gordon, Vassar College

William Jarrold, Samsung Research

Gerard de Melo, Rutgers University

Elizabeth Merkhofer, MITRE Corporation

Todor Mihaylov, Heidelberg University

Ashutosh Modi, Disney Research

Sreyasi Nag Chowdhury, Max-Planck-Institut for Informatics

Juri Opitz, Heidelberg University

Letitia Elena Parcalabescu, Heidelberg University

Debjit Paul, Heidelberg University

Hannah Rashkin, University of Washington

Simon Razniewski, Max Planck Institute for Informatics

Niket Tandon, Allen Institute for AI

Adam Trischler, Microsoft Research

COIN: COmmonsense INference in Natural Language Processing

Workshop to be held in conjunction with EMNLP-IJCNLP in Hong Kong

November 3, 2019

Mailing list

Invited Speakers

Yejin Choi: Commonsense Intelligence: Cracking the Longstanding Challenge in AI

Michael Witbrock: Learning to Reason: from Question Answering to Problem Solving