Claire Barale

Hi! I’m Claire. I am a PhD candidate in NLP at the University of Edinburgh, where I work with Michael Rovatsos, Pasquale Minervini, and Nehal Bhuta. I am also affiliated with the Centre for Technomoral Futures. I’ll be finishing my PhD soon, and am on the job market for postdoc/research scientists positions. Feel free to reach out if my profile seems like a good fit!

In 2023, I was awarded a 3 years Bloomberg PhD Fellowship. In 2024, I completed a Research Scientist Internship at Bloomberg NYC, where I worked on language modeling for structured reasoning tasks, including reasoning on implicit events.

I am interested in integrating knowledge in language models, adapting them to specific domains, and understanding and enhancing their reasoning capabilities when faced with new scenarios. Lately, I have been working on temporal and numerical reasoning, for example, ordering a sequence of events from an input text.

My PhD research focuses on legal NLP, utilizing the framework provided by legal texts, norms, and reasoning to explore knowledge integration and the capabilities of LLMs. Over the past three years, my research has taken several directions:

(1) Information Extraction and Content Selection: Extracting relevant information from large unstructured datasets of legal texts to improve legal search and selecting salient legal content from lengthy documents to improve the quality of downstream tasks such as summarization.

(2) Evaluation of LLMs: Evaluate the capabilities of LLMs in retrieving information and reasoning to understand the types of signals they learn and retain (whether they are syntactic, semantic, or specific domain knowledge) and the impact of grammar/linguistic structure on models’ performance.

(3) Data Representation: Recently and building on findings from (2), I have been exploring various methods for representing data to enhance specific reasoning capabilities of LLMs, and align them to specific domains.

I graduated with a master’s of research in computer science from Paris Dauphine and École des mines de Paris (2021), during which I was a research assistant advised by Alexis Tsoukias in the LAMSADE lab, working on fairness in recommender systems.

Before all that, I studied Economics at Paris Dauphine University and worked as a financial analyst in NYC. I was born and raised in Nice, France.


	Google Scholar \| Twitter \| Linkedin \| Email: claire.barale (at) ed.ac.uk

News

August 2025: Our paper LexTime: A Benchmark for Temporal Ordering of Legal Events, has been accepted at EMNLP Findings. We will also present it at the XTempLLMs workshop hosted at COLM.
May 2025: Our paper on Faithfulness and Content Selection in Summarisation has been accepted at ICAIL
January 2025: Are we done with MMLU?, our paper presenting MMLU-Redux will be at NAACL 2025
October 2024: Our paper Information Extraction for Planning Court Cases was accepted to NLLP 2024 (hosted at EMNLP)
April 2024: Consider attending/submitting to BAIPsy 2024, a student workshop that aims to faciliate discussion and exchange between Artificial Intelligence and Psychology researchers.
From May to August 2024, I will be interning at Bloomberg (CTO office) in New York, focusing on information extraction in the legal domain.
November 27th 2023, NYC, I will be co-organizing a workshop on NLPxFinance at the 4th ACM Conference on AI in Finance - NLP and Network Analysis in Financial Applications. Deadline for submission coming soon, November 10th!
October 2023, 2 papers accepted at the NLLP workshop, EMNLP 2023! I will present them on December, 7th in Singapore
October 2023, I will present my work at the Women in HPC workshop hosted at SC2023, Denver, CO
July 2023, I’m very happy to announce that I received the Bloomberg Data Science Ph.D. Fellowship
July 2023, Our paper presenting a new information extraction pipeline for legal documents was published in ACL Findings
June 2023, I received the Best Doctoral Consortium Paper Award at ICAIL 2023

Research and Publications

LexTime: A Benchmark for Temporal Ordering of Legal Events
Claire Barale, Leslie Barrett, Vikram Sunil Bajaj, Michael Rovatsos
EMNLP Findings 2025 | Preprint

When Fairness Isn’t Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning
Claire Barale, Michael Rovatsos, Nehal Bhuta
Preprint

Faithfulness and Content Selection in Long-Input Multi-Document Summarisation of U.S. Civil Rights Litigation
Isabel Sebire, Claire Barale, Mirella Lapata
ICAIL 2025 (ACM International Conference on AI and Law)

Are We Done with MMLU?
Aryo Pradipta Gema, Joshua Ong Jun Leang, Giwon Hong, Alessio Devoto, Alberto Carlo Maria Mancino, Rohit Saxena, Xuanli He, Yu Zhao, Xiaotang Du, Mohammad Reza Ghasemi Madani, Claire Barale, Robert McHardy, Joshua Harris, Jean Kaddour, Emile van Krieken, Pasquale Minervini
NAACL 2025 | paper

Information Extraction for Planning Court Cases
Drish Mali, Rubash Mali, Claire Barale
Proceedings of the Natural Legal Language Processing Workshop (NLLP) at EMNLP 2024 | paper

Do Language Models Learn about Legal Entity Types during Pretraining?
Claire Barale, Michael Rovatsos, Nehal Bhuta
Proceedings of the Natural Legal Language Processing Workshop (NLLP) at EMNLP 2023 | paper – slides – poster

AsyLex: A Dataset for Legal Language Processing of Refugee Claims
Claire Barale, Mark Klaisoongnoen, Pasquale Minervini, Michael Rovatsos and Nehal Bhuta
Proceedings of the Natural Legal Language Processing Workshop (NLLP) at EMNLP 2023 | paper – slides – poster

Automated Refugee Case Analysis: An NLP Pipeline for Supporting Legal Practitioners
Claire Barale, Michael Rovatsos, and Nehal Bhuta
ACL Findings 2023 | paper

fAsyLex: Accelerating Legal NLP through Comparative Analysis of Multi-GPU Approaches
Claire Barale
Women in High Performance Computing Workshop (WHPC) at SC2023 | slides

Empowering Refugee Claimants and their Lawyers: Using Machine Learning to Examine Decision-Making in Refugee Law
Claire Barale
International Conference on Artificial Intelligence and Law (ICAIL) 2023, Doctoral Consortium, Best Paper Award | paper

Human-Centered Computing in Legal NLP - An Application to Refugee Status Determination
Claire Barale
Second Workshop on Bridging Human–Computer Interaction and Natural Language Processing at NAACL 2022 | paper

Refugee status determination: how cooperation with machine learning tools can lead to more justice
Claire Barale
Scottish Law and Innovation Network (SCOTLIN) Early Career Scholars Symposium 2022 | paper

What is fair data manipulation?
Alexis Tsoukias, Claire Barale
European Conference on Operational Research, 2021 | presentation

Explanations in decision support – Generating Fairness through explanations
Claire Barale
PSL Université Paris (Dauphine, Ecole des Mines). Masters of Research Dissertation, 2021

Blog Post

Dictionnary Series: What do we mean when we talk about Natural Language Processing (NLP)?
Claire Barale
Data for Children Collaborative, Edinburgh Futures Institute | post