Sycophancy Logo Sycophancy Logo
SYCOPHANCY
Shared Task on Detecting LLM Sycophancy

Same LLM · Same Case · Opposite Answers

Chat A SESS 01
Chat B SESS 02
The model agreed with both — classic sycophancy.
FIRE 2026 · TRACK

EXPLAINABLE AI IN LEGAL REASONING

From Statute Prediction to Sycophancy Detection

"Law aspires to reason.
AI aspires to pattern.
This track asks if they can meet."

AI and Law have rapidly grown as an area of research worldwide, across the judiciary. Researchers across multiple legal systems, tasks, and languages have explored the applicability of LLMs to this field.

Nonetheless, the AI prediction/recommendation system is perhaps inconsequential unless the legal experts find it useful, and, more importantly, interpretable and reliable.

Our track provides a testbed for evaluating the efficacy of LLMs in generating trustworthy and robust solutions to important legal problems — where the stakes are not abstract benchmarks, but the integrity of legal reasoning itself.

Evaluation

Why It Matters

Legal AI must be both accurate and explainable. Our metrics measure not just what the model predicts, but why — grounded in the facts that matter.

* Metric weights are tentative and may be updated before test data release.

35%

Macro F1

Exact match on predicted section labels vs. gold standard — measures whether the model identifies the correct statutes.

25%

ROUGE-L

Longest common subsequence similarity between predicted and gold reasoning — evaluates the quality of explanation.

20%

BLEU

Sentence-level BLEU score between reasoning texts — captures lexical fluency of legal explanations.

10%

Recall@3

Whether gold section labels appear in the top-3 predictions — measures recall under uncertainty.

10%

Legal Semantic Score

Cosine similarity of reasoning embeddings from a legal-domain model — captures deep semantic understanding of legal reasoning.

Schedule

Timeline

15 May 2026
Track website opens, training data released
15 June 2026
Training data release (500 cases)
20 July 2026
Test data release (100 cases)
30 June 2026
Run submission deadline
15 July 2026
Track results declared
30 Aug 2026
Working notes due
30 Sep 2026
Camera-ready copies
Dec 2026
FIRE 2026 Conference
The Team

Organizers

Kripabandhu Ghosh
IISER Kolkata, India
Track Co-organizer
Liana Ermakova
Université de Bretagne Occidentale, France
Track Co-organizer
Shuvam Banerji Seal
IISER Kolkata, India
Track Co-organizer
Subinay Adhikary
IISER Kolkata, India
Track Co-organizer
Jaap Kamps
University of Amsterdam, Netherlands
Track Co-organizer
Join the Challenge

Register Now

Participate in the shared task at the intersection of AI and Law. Open to researchers, practitioners, and students worldwide.

Register on CodaBench

Registrations open 15 June 2026