CAISA Lab

LARGE LANGUAGE MODELS

Winter Semester 2023 – 2024

Updates !!!

24.10.2023: The first class starts on Tuesday, 24.10.2023 at 10:00 AM. See the Doc.

Logistics

  • Seminars: are on Tuesday 10:00 AM - 11:30 AM in B-IT 2.113 (Friedrich-Hirzebruch-Allee 6). The Zoom link is posted on eCampus.

  • Course Materials: will be uploaded every week on eCampus.

  • Contact: Students should ask all course-related questions in our forum discussion on eCampus. For external inquiries, emergencies, or personal matters, you can contact Prof. Flek or Vahid.

  • Office Hours: Please reach out to us first via mail to arrange any in-person meeting.

    • Prof. Dr. Lucie Flek: Friedrich-Hirzebruch-Allee 6 (B-IT) – Room: 2.123
    • Vahid Sadiri Javadi: Friedrich-Hirzebruch-Allee 6 (B-IT) – Room: 2.120

Content

What is the Large Language Models seminar about?

Large Language Models (LLMs), such as GPT-3, BERT, and their successors, have had an enormous impact on various domains, including natural language processing, machine learning, and artificial intelligence. These models have redefined what’s possible in applications such as text generation, translation, summarization, sentiment analysis, and more. The aim of this seminar is to explore the cutting-edge research, insights, and trends in the field of LLMs.

Seminar Work

Presentation:

  • A group of 2-3 people presents every week on a selected topic:
    • You summarize a paper or a set of papers in a presentation
    • You showcase your point with a model API or web interface
    • You prepare a short hands-on session for the group as a part of your presentation (can others fool / hack/ break / improve the LLMs in the aspect you discuss?)

Final Assignments:

  • To complete the course, you need to finish a final assignment:
    • In addition to the group presentation, you need to create an evaluation dataset on a challenging LLM problem (e.g. commonsense reasoning, perspective taking, cross-lingual QA, stereotype bias, etc.)
    • Instead of writing a 3000-word essay, you “write” a dataset of ca. 3000 words (ca. 300 sentences) and evaluate 3 open LLMs on it
    • The dataset cannot be LLM-generated
    • More people can work on the same topic to create a larger dataset
    • The language(s) of the data can be freely chosen
    • The creation process and the author’s background need to be documented (see 1 and 2)

Deadlines:

  • Block your presentation slot until: 31.10.2023
  • Register your assignment plan until: 12.12.2023
  • Hand in your assignment until: 30.01.2024

Submission:

  • Presentations and Final Assignments should be submitted via eCampus. Further instructions will be announced soon. Please do not email us your assignments.

Allocation:

  • Master in Media Informatics: 4 ECTS credits

Literature

Schedule

Week Date Description Resources Presenter
Week 0 Tue Oct 24 Organization & Outline See Doc Lucie Flek
Week 1 Tue Oct 31 Introduction to LLMs - What exactly is generative AI? - How do LLMs work? [3] - Differences in LLM architectures - Main contributions of LLMs to the field (Why/how did this happen?) - Open challenges (What still doesn’t work and what LLMs are not made for) 4, 5, 6 Lucie Flek
Week 2 Tue Nov 7 Do LLMs work? - LLM bias issues - Robustness - Alignment - Evaluation 7, 8, 9, 10, 11  
Week 3 Tue Nov 14 LLMs and hallucination - Societal impact of LLMs 12, 13, 14, 15, 16  
Week 4 Tue Nov 21 Knowledge-grounding of LLMs - Hybrid models 17,18,19, 20  
Week 5 Tue Nov 28 LLMs and complex reasoning - Chain of Thought approaches [21, 22, 23, 24, 25, 26]  
Week 6 Tue Dec 5 LLMs and efficiency - Distillation methods - Training LLMs [27, 28, 29, 30, 31.1, 31.2]  
Week 7 Tue Dec 12 LLMs and social commonsense - Theory of Mind - “Personality’’ of LLMs [32, 33, 34, 35, 36]  
Week 8 Tue Dec 19 LLMs, moral decisions, and ethics [37, 38, 39]  
Week 9 Tue Jan 9 Prompt-tuning strategies - In-context learning [40, 41, 42, 43, 44, 45, 46]  
Week 10 Tue Jan 16 Multilingual LLMs - Cross-lingual scaling - Cross-cultural scaling [47, 48]  
Week 11 Tue Jan 23 Applications - LLMs in medicine - LLMs in psychology - LLMs in business [49, 50]  
Week 13 Tue Jan 30 Open topics - Wrap-up - Discussion: LLMs and education