Genealogy workflows are structured and repetitive
Long term, we may need:
✔ lower latency
✔ predictable behavior
✔ private inference
✔ cost efficiency
Curated genealogical examples + reasoning
GPT-5.1 / Claude Sonnet (Teacher)
Fine-tuned smaller model (e.g., 7B)
According to Chat GPT
| Size | Runs Where | Notes |
|---|---|---|
| 3B | mobile/laptop | Fast, but may lack reasoning depth |
| 7B | AWS GPU | Best balance of accuracy + cost |
| 13B | Larger GPU | More capable, more cost |
| 300B? | Dedicated farm of H200s | LLMs |
| Stage | Cost | Notes |
|---|---|---|
| Dataset prep (internal work) |
Time-based | Needs sampling and formatting |
| Training Compute | $2K-$10K | Depends on dataset size + iterations |
| Evaluation/Refinement | $0K-$5K | Optional second pass |
AI Research Team - $30K on hardware (need to confirm)
ATB Team - $10K to train a model (need to confirm)
| Approach | Pricing Model | Cost of 150 rps per month |
|---|---|---|
| Hosted Sonnet/GPT | per-token | $1M - $14M |
| Self Hosted (AWS) | Fixed compute | $5K - $10K |
Tree-data costs around $1100/month
150rps = num of rps on person detail page in the last month
Minimum Viable Technical Proof