core idea extraction
a system for generalizing the core idea from any piece of content — text, image, audio, video — stripping away the surface framing to find the underlying concept. the motivation is that the same insight often appears in many different forms, and most people encounter it multiple times in different guises without recognizing the common thread. an extraction system would find that thread and make it explicit.
the output would be something like: "the core idea in this paper / this conversation / this image is [X], which is an instance of [broader principle], related to [other ideas].". this requires understanding not just what content is about but what conceptual structure it exemplifies — which is much harder than summarization. the interesting research question is whether LLMs are good at this kind of abstraction across modalities, or whether they just rephrase surface content.
related: cultural translator, idea tester, dense info generator