Events
Reading Group
CINEMETRIC: A Framework for Multi-Perspective Evaluation of Conversational Agents using Human-AI Collaboration
Vahid Sadiri Javadi
2025-09-10
Optimising your training data using model-led iterative confidence-based sample selection
Frederik Labonte
2025-07-09
Superintelligent Agents Pose Catastrophic Risks: Can Scientist AI Offer a Safer Path?
Akbar Karimi
2025-06-18
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?
David Kaczér
2025-05-07
ARITHMETIC WITHOUT ALGORITHMS: LANGUAGE MODELS SOLVE MATH WITH A BAG OF HEURISTICS
Akbar Karimi
2025-04-23
On Calibration of Speech Classification Models: Insights from Energy-Based Model Investigations
Lea Fischbach
2025-04-16
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching
Frederik Labonte
2025-04-02
How do Humans and Language Models Reason About Creativity? A Comparative Analysis
Wei-Fan Chen
2025-03-05
Assessing Social Alignment: Do Personality-Prompted Large Language Models Behave Like Humans?
Christian Nickel
2024-12-11
Are Large Language Models Capable of Generating Human-Level Narratives?
Vahid Sadiri Javadi
2024-12-04
Writing in the Margins: Better Inference Pattern for Long Context Retrieval
Frederik Labonte
2024-09-11
Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP
Akbar Karimi
2024-07-10
Diffusion-NAT: Self-Prompting Discrete Diffusion for Non-Autoregressive Text Generation
Wei-Fan Chen
2024-07-03
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Allison Lahnala
2024-05-29
Can Large Language Models Provide Useful Feedback on Research Papers? A Large-Scale Empirical Analysis
Mounika Marredy
2024-05-22
Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs
Shaina Ashraf
2024-04-03
Evaluating Large Language Models as Generative User Simulators for Conversational Recommendation
Vahid Sadiri Javadi
2024-03-27
Sensitivity, Performance, Robustness: Deconstructing the Effect of Sociodemographic Prompting
Charlie Welch
2024-03-20
Promptbreeder: Self-Referential Self-Improvement Via Prompt Evolution
Vahid Sadiri Javadi
2024-01-17
Persona-Guided Planning for Controlling the Protagonist’s Persona in Story Generation
Charlie Welch
2024-01-10
Large Language Models of Code Fail at Completing Code with Potential Bugs
Mounika Marredy
2023-12-20