Materials for a one-day workshop on LLM data collection
This repository contains the code and slides for our workshop on data collection and inference with Large Language Models. The materials on this page are CC-BY-4.0 licensed.
More information can be found on the website here.

R or python programming knowledge is desired but not required.langchain, in R we will use ellmer to interact with LLMs.| Time | Title | Resource |
|---|---|---|
| 09:30 | LLM fundamentals for Social Sciences | |
| 11:00 | Coffee break | Coffee is provided! |
| 11:20 | Data collection/annotation with LLMs | python, R |
| 12:30 | Break | Lunch is provided! |
| 13:15 | Inference with LLM annotations | python, R |
| 14:30 | Conclusion & Q&A |
Methods and software for inference with measurement error correction: sodascience/social_science_inferences_with_llms.
This project is developed and maintained by the ODISSEI Social Data Science (SoDa) team.

Do you have questions, suggestions, or remarks? File an issue or feel free to contact Qixiang Fang or Erik-Jan van Kesteren.