How BioSniper verifies every number
“AI-powered” is table stakes. What makes an answer trustworthy is whether you can click through to the SEC filing, FDA record, or clinical trial behind it. That traceability is engineered into BioSniper at every layer — here is exactly how.
7 authoritative public sources
BioSniper aggregates only public, authoritative datasets. Core sources are checked hourly; market quotes refresh every minute. Nothing is scraped from opinion sites or social media into the evidence base.
- SEC EDGAR
10-K, 10-Q, 8-K, S-1 filings & insider transactions
- FDA
PDUFA dates, approvals, CRLs & Advisory Committee meetings
- ClinicalTrials.gov
Trial registrations, status changes & readouts
- PubMed
Peer-reviewed scientific literature
- Patents
USPTO & Google Patents via BigQuery
- News
Real-time biotech & pharma news aggregation
- Market Data
Stock quotes, ETFs & market indicators
Citation enforcement — no evidence, no claim
Every AI-generated statement on BioSniper must cite the source it came from — inline tags like [sec], [clinical] or [fda] point at the document the model actually read.
Quoted evidence is held to a stricter bar: quotes must appear verbatim in the source text. After the model answers, an independent verification step re-checks every quote against the retrieved documents, character for character. A quote that cannot be found is discarded before it ever reaches a page — regardless of how plausible it sounds.
And when the evidence simply is not there, BioSniper answers UNKNOWNinstead of guessing. Missing data is shown as missing — never estimated, never interpolated, never filled with a model’s best guess.
The citation-verification pass rate shown above is computed live from our verification logs over the past 7 days — it is a measurement, not a marketing claim.
Automated correctness checks, every day
Ingesting data is the easy part; keeping it correct is not. A daily automated audit sweeps the entire dataset across more than a dozen correctness dimensions — freshness, duplicates, referential integrity, misattribution (a real number attached to the wrong company), fabricated-success patterns, field quality, and coverage across every actively listed company.
Failures page the team automatically. When an upstream source silently changes its format — regulators do this without notice — these checks are how we find out before you do.
See it in practice — every evidence quote on a company page links to the original document.