[P] Looking for Thesis Ideas: Extracting Pipelines

Hi everyone,

I’m currently starting my Master’s thesis, and I’ll be working on the topic of automatically extracting pipelines from scientific research papers. The core idea is to extract how researchers actually process, model, and evaluate their data, by turning the “Methods” section into a structured, step-wise list/workflow.

Someone under the same advisor already did a project on this (using LLMs + GROBID + semantic retrieval), so I’m looking for ways to extend or approach the problem differently, ideally in a way that’s both research-worthy and practically useful.

One initial idea I’ve brainstormed include was – Extracting pipeline steps and Reconstructing directed graphs of those (not just flat lists).

Have any of you worked on something similar? Or seen cool papers/tools that deal with pipeline reconstruction, reproducibility, or scientific method understanding?

Would love to hear: – Ideas you think are worth exploring – Common pain points in method reporting that could be addressed

Thanks in advance!

submitted by /u/ieatasssss25
[link] [comments]

Leave a Reply