How BioSniper verifies every number

“AI-powered” is table stakes. What makes an answer trustworthy is whether you can click through to the SEC filing, FDA record, or clinical trial behind it. That traceability is engineered into BioSniper at every layer — here is exactly how.

909 companies229,493 SEC filings18,657 clinical trials8,603 FDA decisions99.99% citation-verification pass rate (7d)
Core sources checked hourly·How we verify →

7 authoritative public sources

BioSniper aggregates only public, authoritative datasets. Core sources are checked hourly; market quotes refresh every minute. Nothing is scraped from opinion sites or social media into the evidence base.

  • SEC EDGAR

    10-K, 10-Q, 8-K, S-1 filings & insider transactions

  • FDA

    PDUFA dates, approvals, CRLs & Advisory Committee meetings

  • ClinicalTrials.gov

    Trial registrations, status changes & readouts

  • PubMed

    Peer-reviewed scientific literature

  • Patents

    USPTO & Google Patents via BigQuery

  • News

    Real-time biotech & pharma news aggregation

  • Market Data

    Stock quotes, ETFs & market indicators

Citation enforcement — no evidence, no claim

Every AI-generated statement on BioSniper must cite the source it came from — inline tags like [sec], [clinical] or [fda] point at the document the model actually read.

Quoted evidence is held to a stricter bar: quotes must appear verbatim in the source text. After the model answers, an independent verification step re-checks every quote against the retrieved documents, character for character. A quote that cannot be found is discarded before it ever reaches a page — regardless of how plausible it sounds.

And when the evidence simply is not there, BioSniper answers UNKNOWNinstead of guessing. Missing data is shown as missing — never estimated, never interpolated, never filled with a model’s best guess.

The citation-verification pass rate shown above is computed live from our verification logs over the past 7 days — it is a measurement, not a marketing claim.

Automated correctness checks, every day

Ingesting data is the easy part; keeping it correct is not. A daily automated audit sweeps the entire dataset across more than a dozen correctness dimensions — freshness, duplicates, referential integrity, misattribution (a real number attached to the wrong company), fabricated-success patterns, field quality, and coverage across every actively listed company.

Failures page the team automatically. When an upstream source silently changes its format — regulators do this without notice — these checks are how we find out before you do.

See it in practice — every evidence quote on a company page links to the original document.