Meta-Learning Research System M.Sc. Thesis · In Progress¶

FastAPI Celery PostgreSQL Qdrant Redis MLflow Docker QLoRA Qwen3

The Problem¶

Researchers starting new AI/ML projects face a recurring challenge: given a new problem, which model architecture, hyperparameters, and training strategy should they use? The existing literature is vast and navigating it manually is slow and error-prone. There's no systematic way to extract and reuse the configurations that worked for similar problems.

The Approach¶

This system automates the process of extracting structured metadata from AI/ML research papers and recommending optimal model configurations for new problems. It combines several techniques into a cohesive pipeline:

Multi-LLM Annotation Pipeline¶

Multiple large language models annotate papers in parallel, extracting structured information about datasets, model architectures, hyperparameters, and reported results. Cross-validation between LLM outputs improves annotation quality.

QLoRA Fine-Tuning¶

The system fine-tunes Qwen3-8B using QLoRA (via Unsloth) on the annotated corpus, creating a specialized model that understands the structure of ML research papers and can extract configurations with high accuracy.

Calibrated Active Learning¶

Rather than annotating every paper equally, the system identifies which papers would provide the most informational value and prioritizes them — reducing annotation costs while maximizing coverage.

Vector Search & Recommendation¶

Extracted configurations are stored in Qdrant as vector embeddings. Given a new problem description, the system performs semantic search to find the most relevant existing configurations and recommends starting points.

Key Features¶

End-to-end pipeline — from raw PDF papers to actionable model recommendations
QLoRA fine-tuning with Qwen3-8B/Unsloth for efficient domain adaptation
Calibrated active learning to minimize annotation effort
Multi-LLM consensus for robust metadata extraction
Svelte validation UI for human-in-the-loop verification
Scalable infrastructure — FastAPI + Celery workers, PostgreSQL for metadata, Qdrant for vectors, Redis for task queuing, MLflow for experiment tracking

Architecture¶

graph LR
    A[PDF Papers] --> B[Multi-LLM Annotator]
    B --> C[Active Learning Selector]
    C --> D[QLoRA Fine-Tuning]
    D --> E[Metadata Extraction]
    E --> F[(PostgreSQL)]
    E --> G[(Qdrant Vectors)]
    H[New Problem] --> G
    G --> I[Configuration Recommendations]

Tech Stack¶

Component	Technology
API	FastAPI
Task Queue	Celery + Redis
Database	PostgreSQL
Vector Store	Qdrant
Experiment Tracking	MLflow
Fine-Tuning	QLoRA, Qwen3-8B, Unsloth
Frontend	Svelte
Deployment	Docker