Social completeness: X attempted 5/N/A, FB attempted 3/N/A, Reddit 10 errors, YouTube unavailable in this fallback collector.
Hot coding-agent CLI/runtime; Emerging harness eval nội bộ; Watch multi-agent orchestration; Noise benchmark claims không repro; Declining pure autocomplete-only rollout.
X/YouTube KOL metrics: N/A trong cron do public/API block; link attempts ở appendix.
Thesis: agentic SDLC tạo ROI khi có harness + sandbox + context layer. Counter-signal: public benchmark/social metrics thiếu trong run → giảm confidence còn 62%. Decision: trial, không adopt toàn công ty.
| Repo | Metric | Fabbi move |
|---|---|---|
| jazzyalex/agent-sessions | 588 stars/36 forks/1 issues | Trial nếu khớp harness/runtime |
| jarrodwatts/claude-hud | 23981 stars/1079 forks/14 issues | Trial nếu khớp harness/runtime |
| shareAI-lab/learn-claude-code | 63362 stars/10358 forks/107 issues | Trial nếu khớp harness/runtime |
| colbymchenry/codegraph | 32054 stars/1900 forks/202 issues | Trial nếu khớp harness/runtime |
| esengine/DeepSeek-Reasonix | 12993 stars/755 forks/315 issues | Trial nếu khớp harness/runtime |
| vercel-labs/zerolang | 4667 stars/298 forks/123 issues | Trial nếu khớp harness/runtime |
| mackerelio/mackerel-agent | 428 stars/90 forks/4 issues | Trial nếu khớp harness/runtime |
| superradcompany/microsandbox | 6346 stars/308 forks/50 issues | Trial nếu khớp harness/runtime |
| mochilang/mochi | 328 stars/14 forks/51 issues | Trial nếu khớp harness/runtime |
| barnum-circus/barnum | 106 stars/4 forks/3 issues | Trial nếu khớp harness/runtime |
| deusyu/harness-engineering | 3245 stars/295 forks/0 issues | Trial nếu khớp harness/runtime |
| bgdnvk/clanker | 320 stars/17 forks/6 issues | Trial nếu khớp harness/runtime |
| Product | Signal | Decision |
|---|---|---|
| Claude Code / Codex / Cursor / Copilot | 4 major agent surfaces tracked | Trial 2 tools on same Fabbi task set |
| Devin / Replit / Jules | 3 autonomous-agent products tracked | Watch for enterprise controls |
| OpenCode / Sourcegraph / JetBrains | 3 workflow/codebase-context angles | Use as architecture reference |
| Domain | Now 0-2w | Next 1-2m | Decision |
|---|---|---|---|
| FARE | Codebase context benchmark | Repo graph/RAG eval | Trial |
| NEXA | CLI agent harness | Sandbox exec + task replay | Trial |
| SYNCA | Risk/quality gates | PR policy + audit log | Adopt |
| DOMUS | Monitor | Backoffice workflow POC | Watch |
| Japan/VN/Global | Market monitoring | Offer AI-SDLC assessment | Trial |
| Signal | Evidence | Counter-signal | Implication | Confidence | Decision | Next validation |
|---|---|---|---|---|---|---|
| Agent CLI trở thành primary dev surface | 45 GitHub + 10 product refs | Social engagement N/A | NEXA harness needed | 70% | trial | 20-task benchmark |
| Reliability/eval là bottleneck | 40 HN/dev-web refs | arXiv feed failed | SYNCA quality gates | 62% | adopt controls | Defect escape-rate test |
| Context engineering matters hơn model-only | Repo/product overlap | No quantified customer data | FARE investment | 64% | trial | Repo onboarding time metric |
Scanned 119 candidates. Breakdown: {'HN': 40, 'GitHub': 45, 'GitHub_ERR': 1, 'arXiv_ERR': 5, 'Reddit_ERR': 10, 'Product': 10, 'X_ATTEMPT': 5, 'FacebookPublic_ATTEMPT': 3}. PASS volume >=100. PARTIAL social: X/FB public search attempted but engagement unavailable; Reddit JSON returned errors in fallback collector; YouTube collector unavailable. Confidence impact: -18 pts. No metrics fabricated.