Bogdan Paul Chiș
ERASMUS Intern Spring 2026, 20 March - 21 May (62 days)
Topic:
(multilingual) Triplet extraction from medical data
Goal:
- Construct a knowledge graph from medical package inserts in multiple languages
- Utilize the graph in an intelligent agent that recommends medication.
Tasks:
- Learn intelligent agents and generative models - OpenAI API, Agent frameworks, RAG systems.
- Learn about knowledge graphs and GraphRAG. Read several research papers.
- Find several existing drug knowledge databases. Identify possible entities and relations.
- Prepare a Python based workflow, use git code repository
- Try Light RAG - Simple RAG.
- Scrape package inserts and parse the data.
- Index the data and write a script that extracts a knowledge graph from data.
- Visualize the graph
- Prepare an agent that utilizes the unstructured data and graph-data.
- Evaluate the agent using DeepEval or RAGAS.
- Write a report
- Put all code to GIT