The scoring let too many weak leads through.
The first version qualified shows that technically hit the thresholds but weren't strong fits. Reply quality was low and it showed in the numbers.
Fix: Recalibrated thresholds, added recency weight. Signal quality improved immediately.