research: LLM fine-tuning pipeline — tip reactions as training signal #96
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Goal
Use accumulated tip reaction data to fine-tune the
tip-generatormodel. Tips that gotdonein the magic zone (15s–2min) are positive examples;dismisswithin 5s are negative.Training data construction
Pipeline (Airflow DAG
finetune_tip_generator)tip_scores(requires ≥500 positive examples)Notes