financial-research-evals

Public-safe AgentV result artifacts for the financial-research-agent demo project.

Source eval definitions live in EntityProcess/financial-research-agent. This repo stores Dashboard-ready artifacts under .agentv/results/runs/ only. Before pushing artifacts, run the public artifact preflight from agentv-deploy:

python3 ../agentv-deploy/scripts/check-public-result-artifacts.py .

Writer credentials should come from RESULT_SYNC_GITHUB_TOKEN or local git/gh auth and should be scoped only to this result repository where possible. Reader mode is anonymous HTTPS clone/pull.

Published validation runs

50-case Codex financial-research baseline — aggregate public baseline over 50 Dexter-adapted financial research questions.
One-test Codex web-search baseline — early live plumbing check for the native Dexter llm-grader rubric shape.

The source/eval repository also has a public narrative report: EntityProcess/financial-research-agent BASELINE_RESULTS.md.

Static HTML reports

50-case Codex financial-research baseline report — generated with agentv results report .agentv/results/runs/age-14-task-bundle-dogfood/2026-06-10T08-35-26Z-age-14-codex --out docs/index.html. Published at https://entityprocess.github.io/financial-research-evals/.
One-case Dexter Codex web baseline report — generated with agentv results report .agentv/results/runs/av-zk0.3-dexter-codex-web-baseline/2026-06-10T04-04-57-866Z --out docs/dexter-baseline.html. Published at https://entityprocess.github.io/financial-research-evals/dexter-baseline.html.

GitHub Pages serves the docs/ directory as the project homepage plus secondary baseline pages. Both files are self-contained, read-only AgentV reports and do not require a Dashboard server.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.agentv/results		.agentv/results
docs		docs
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

financial-research-evals

Published validation runs

Static HTML reports

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

financial-research-evals

Published validation runs

Static HTML reports

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages