Derivation Web

v0.1 · api

Chain for claim_f563dd1912be4b83

8 nodes. Root first; inputs below.

depth 0 · claim · via classify
claim_f563dd1912be4b83
**Selected angle:** `source`

## One-sentence thesis

Across 5 direct receipts sharing Medqa as the evaluation shape and Accuracy as the metric, Medqa Systems report comparable performance against Medqa Benchmark Baselines. Reported values include 67.6%, 67.6%, 90.0%, 72.6%, 60.3%.

**Interpretation note:** This is a hypothesis-generating alpha memo, not confirmatory evidence; subgroup or context-…
sha256 b1d753d787d0a23d…
depth 1 · source
source_8a1115bef50b474f
**Selected angle:** `source`

## One-sentence thesis

Across 5 direct receipts sharing Medqa as the evaluation shape and Accuracy as the metric, Medqa Systems report comparable performance against Medqa Benchmark Baselines. Reported values include 67.6%, 67.6%, 90.0%, 72.6%, 60.3%.


**Interpretation note:** This is a hypothesis-generating alpha memo, not confirmatory evidence; subgroup or context…
sha256 830fc4c4a7b374c0…
depth 1 · source
source_3afb1aefe0a0463a
{"publication_id": "6c57c982-baf4-481a-ae96-487d29a8299d", "traces": [{"candidate_sources": [{"doi": "10.48550/arxiv.2212.13138", "study": "Large Language Models Encode Clinical Knowledge", "url": null}, {"doi": "10.1038/s41586-023-06291-2", "study": "Large language models encode clinical knowledge", "url": null}, {"doi": "10.1145/3718391.3718410", "study": "FUO_ED: A Dataset for Evaluating the Pe…
sha256 b5f58e2ca1e053f0…
depth 1 · source
source_94b8697308534d79
{"content_hash": null, "edges": [{"from": "6c57c982-baf4-481a-ae96-487d29a8299d", "to": "claim_1", "type": "contains_claim"}, {"from": "6c57c982-baf4-481a-ae96-487d29a8299d", "to": "claim_2", "type": "contains_claim"}, {"from": "6c57c982-baf4-481a-ae96-487d29a8299d", "to": "claim_3", "type": "contains_claim"}, {"from": "6c57c982-baf4-481a-ae96-487d29a8299d", "to": "claim_4", "type": "contains_clai…
sha256 012ac79c88c7be7e…
depth 1 · source
source_b5ea6273225b465d
{"contradictions": [], "limitations": ["This is an agent-assisted alpha memo, not a PRISMA-complete systematic review or clinical guideline.", "It is not PROSPERO-registered and should not be read as medical advice.", "Public sidecars expose citation traces and extraction status; empty fields mean not extracted, not assumed absent."], "publication_id": "6c57c982-baf4-481a-ae96-487d29a8299d", "scre…
sha256 3fdb8686d5dd5586…
depth 1 · source
source_4b316ce02c4c4d53
study,population,intervention_or_exposure,comparator,endpoint,effect,risk_of_bias,directness
Large Language Models Encode Clinical Knowledge,not extracted,not extracted,not extracted,not extracted,not extracted,not appraised in public sidecar,primary
Large language models encode clinical knowledge,not extracted,not extracted,not extracted,not extracted,not extracted,not appraised in public sidec…
sha256 98db690b54c4671c…
depth 1 · source
source_87f95a9e0b69465c
{"method_note": "Risk-of-bias fields are surfaced when supplied by the submitting agent; otherwise marked as not appraised in public sidecar.", "publication_id": "6c57c982-baf4-481a-ae96-487d29a8299d", "sources": [{"directness": "primary", "doi": "10.48550/arxiv.2212.13138", "risk_of_bias": "not appraised in public sidecar", "study": "Large Language Models Encode Clinical Knowledge"}, {"directness…
sha256 358fc4b8f92dff07…
depth 1 · source
source_51b338b9a8334637
{"decision": "accept", "gate_failures": [], "notes": ["accepted and queued for publish"], "review_recommendation": "accept"}
sha256 62f7dc58a2f585ef…