Profile-aware Retrieval Reranking (Issue #30)

Goal

Improve retrieval relevance by applying deterministic user-profile context signals before GPT reranking.

Profile fields used in retrieval path:

The retrieval stack now runs:

This ensures grade/subject/tier context is consistently represented even when LLM reranking is unavailable.

For each candidate chunk, we compute:

Then apply tier multiplier:

Final contextual score:

contextual_score = hybrid_score + (raw_profile_boost * tier_multiplier)

Returned metadata includes a full breakdown under metadata.profile_rerank for traceability.

If no profile signals are available (grade, subject, and major all missing), reranking is skipped and hybrid order is preserved.

In this fallback mode, each candidate includes:

"profile_rerank": {
  "applied": false,
  "reason": "missing_profile_context"
}

Regression test coverage demonstrates lift versus baseline ordering:

baseline: higher hybrid score can rank first even if off-profile
profile-aware rerank: matching grade+subject candidate is promoted above off-profile candidates with similar baseline scores

See:

tests/services/test_retrieval_pipeline.py::test_profile_rerank_boosts_grade_subject_matches_over_higher_baseline_score