Started: 2026-04-05 Target: 20 submissions across 3+ platforms Accuracy target: ≥70% correct top-5 ranking before Product Hunt launch
No aggregate performance claim should be made yet.
Until the table below has real submissions, ContentForge should be described as:
- deterministic
- explainable
- under active calibration
That is the honest state of the engine today.
The blind taste test is meant to answer one question:
Can the heuristic engine rank historically better performing posts above historically worse ones without seeing the original metrics first?
Minimum standard before stronger public claims:
- At least 5 real submissions
- At least 3 platforms represented
- Clear notes on misses, not just wins
Calibration assets:
docs/calibration_dataset_template.csvdocs/calibration_dataset_template.jsonscripts/calibrate_content.pydocs/calibration_examples.json
Launch feedback notes:
docs/reddit-launch-notes.md
The Reddit launch notes are qualitative market signal only. They are useful for positioning and UX decisions, but they do not count as calibration proof.
| # | Participant | Platform | Posts | Top 5 Correct | Accuracy | Date |
|---|---|---|---|---|---|---|
| — | — | — | — | — | — | — |
| Metric | Value |
|---|---|
| Total submissions | 0 |
| Platforms covered | 0 |
| Overall accuracy | — |
| Ready for PH? | Not yet |
| Platform | Submissions | Avg Accuracy | Weakest Signal |
|---|---|---|---|
| 0 | — | — | |
| 0 | — | — | |
| 0 | — | — | |
| TikTok | 0 | — | — |
| Other | 0 | — | — |
When a submission reveals a miscalibrated signal, log it here:
| Date | Platform | Signal | Before Weight | After Weight | Accuracy Change |
|---|---|---|---|---|---|
| — | — | — | — | — | — |
Updated automatically as submissions come in via Discussion #4.