Problemunvalidated
The obvious move is to reuse the existing extraction eval gate: entity/edge macro-F1 plus substrate-recall — for a probabilistic prediction call-site (an LLM pre-pass that emits entities/relations each with a probability p, later merged into a Bayesian prior). Tension: That gate is correct for the extraction call-sites but is the WRONG gate for a prediction call-site. Outcome: would either pass a bad model or fail a good one.
cf9e6475-399e-4ac4-bef9-32035d785283
The obvious move is to reuse the existing extraction eval gate: entity/edge macro-F1 plus substrate-recall — for a probabilistic prediction call-site (an LLM pre-pass that emits entities/relations each with a probability p, later merged into a Bayesian prior). Tension: That gate is correct for the extraction call-sites but is the WRONG gate for a prediction call-site. Outcome: would either pass a bad model or fail a good one.