Cholesterol & Statins

External Review Panel Appraisal

Cholesterol, statins, and the LDL hypothesis — A critical synthesis (Booth, May 2026)

Reviewed by: A simulated five-member multidisciplinary external panel Document audited: 72,962 words, 3,338 lines, 1.0 edition Use case under appraisal: Debate-preparation reference for an intelligent, non-clinician marketing director ahead of a substantive conversation with an industry-employed friend Reviewing posture: Polite but not flattering. The goal is calibration of the actual quality bar, not reassurance.


1. Senior Academic Lipidologist (EAS-consensus-panel standard)

Overall score: 78/100. Biology is largely correct, often elegant, occasionally underspecified, and once or twice quietly tilted in a direction the author signposts as "calibrated."

What is strong

What is weak or wrong

Citation-quality assessment (sampled)

Footnotes audited:

Citation Verdict
[^11] Endo, J Lipid Res 1992;33:1569–82, PMID 1464741 Real, correctly attributed
[^12] Istvan & Deisenhofer, Science 2001;292:1160–4, PMID 11349148 Real and load-bearing — the crystal-structure paper
[^17] Golomb & Evans, Am J Cardiovasc Drugs 2008;8:373–418, PMID 19159124 Real; correctly identified as the central sceptic-side review
[^25] SEARCH Collaborative Group, NEJM 2008;359:789–99, PMID 18650507 Real and correctly load-bearing for SLCO1B1
[^46] Marston et al., JAMA Cardiol 2022;7:250–256 — PMID 34773457 Real but the PMID year does not match year 2022; needs verification
[^47] Glavinovic et al., J Am Heart Assoc 2022;11:e025858, PMID 36216435 Real
[^54] Duewell et al., Nature 2010;464:1357–61, PMID 20428172 Real and correctly load-bearing
[^65] Nicholls et al. obicetrapib BROADWAY 2024 Marked [CITATION NEEDED] by the author — i.e. the author is honest that they could not verify this; appropriate flagging
[^69] Gibson et al. AEGIS-II / CSL112 Marked [CITATION NEEDED] by the author — appropriate flagging

The lipidologist notes that the explicitly flagged [CITATION NEEDED] items (eight or so across Parts I and II) are exactly the right epistemic move — but also that they betray exactly how the document was assembled: in a time budget that did not allow the author to chase down primary sources for every claim. The serious external reviewer would ask for these to be closed before the document is shared more widely.

Verdict

Fit for purpose: Y, with reservations. The biology will not let Anthony down in conversation, with the caveat that the discordance, CoQ10, and Lp(a) sections should be re-read with the specific overreaches above flagged in his mind.


2. Cardiology Trialist (TIMI Study Group standard)

Overall score: 71/100. The trial summaries are mostly accurate, often impressively so, but the master trial tables are over-precise relative to source uncertainty and a few effect sizes are subtly wrong.

What is strong

What is weak or wrong

Citation-quality assessment (sampled)

Citation Verdict
[^31] 4S, Pedersen et al., Lancet 1994 Real, correctly identified
[^33] WOSCOPS, Shepherd et al., NEJM 1995 Real
[^41] HPS, Lancet 2002;360:7–22 Real
[^49] PROVE-IT, Cannon et al., NEJM 2004 Real
[^54] JUPITER, Ridker et al., NEJM 2008;359:2195–2207, PMID 18997196 Real
[^62] Tardif et al., dal-GenE, Eur Heart J 2022;43:3947–3956, PMID 35856777 Real
[^G1] Shared grounding file Not a publication — the document cites itself; appropriate as an internal pointer but not a published source
Part II-B [^64] CTT 2022 muscle-symptoms IPD, Lancet 2022;400:832–845, PMID 36049498 Real and load-bearing for the SAMS chapter
Part II-B [^65] Schubert adherence meta-analysis, Eur Heart J 2025 Marked [CITATION NEEDED] — appropriate flagging by author

The trialist confirms that the major outcomes trials are correctly cited at the level of journal, year, and DOI/PMID. The risk is not citation invention but quantitative over-claim and uneven steelmanning of pivotal trials.

Verdict

Fit for purpose: Y, with reservations. The trial-by-trial summaries will hold up. The trialist recommends Anthony double-check NNT values verbally rather than read them off the tables, and avoid claiming "the trial proved" anything that the trial only suggested.


3. Evidence-Based Medicine Methodologist (Cochrane senior-editor standard)

Overall score: 64/100. The framework is set up well; the application is more rhetorical than the framework deserves; the "calibrated landing" wears the appearance of balance without quite delivering it.

What is strong

What is weak or wrong

Citation-quality assessment (sampled)

Citation Verdict
[^A1] Hill AB, Proc R Soc Med 1965;58:295–300, PMID 14283879 Real and correctly identified
[^A4] Ference et al., J Am Coll Cardiol 2012;60:2631–9, PMID 23083789 Real and load-bearing
Chapter 17 [^58] Ioannidis JPA, "More than a billion people taking statins?" JAMA 2014;311:463–4 Real; the methodologist notes this is one of the most relevant single citations for the calibrated landing
Chapter 19 [^81] Graham, Lancet 2005 estimated Vioxx excess deaths Real and correctly identified
Chapter 19 [^78] BMJ 2014 Godlee/Collins IPD-release exchange Real and correctly described

Verdict

Fit for purpose: Y, with serious reservations. The methodologist would caution Anthony that the document is calibrated within an orthodox prior, and that someone reading the heterodox literature carefully (Ravnskov, Diamond, DuBroff, Demasi) will catch the asymmetries above. The document should not be presented as more even-handed than it is. Anthony should also know that the bibliography and methods sections promised in the contents do not exist.


4. Pharmacology Lecturer (research-active academic standard)

Overall score: 76/100. Drug-class biology is mostly correct and refreshingly precise. The mechanism explanations occasionally cross into territory where the underlying literature is more contested than the prose admits.

What is strong

What is weak or wrong

Citation-quality assessment (sampled)

Citation Verdict
Part IV [^5] SAMSON, Howard et al., JACC 2021;78:1210–22, PMID 34531021 Real and load-bearing
Part IV [^7] STOMP, Parker et al., Circulation 2013;127:96–103 Real
Part IV [^8] GAUSS-3, Nissen et al., JAMA 2016;315:1580–90 Real
Part IV [^17] Sirvent et al., Curr Opin Pharmacol 2008;8:333–8 Real; relevant for prenylation/myotoxicity mechanism
Part IV [^18] Mammen AL, anti-HMGCR autoimmune myopathy, NEJM 2016;374:664–9 Real and correctly identified
Part IV [^56] SEARCH Collaborative Group SLCO1B1 GWAS Real; foundational for the pharmacogenomic outlier case
Part IV [^58] CPIC SLCO1B1/ABCG2/CYP2C9 guideline, Cooper-DeHoff 2022 Clin Pharmacol Ther Real and current
Part IV [^54] Sanchis-Gomar et al., "Effects of statins on renal function: a Mendelian randomization study," Atherosclerosis 2020;293:8–14 Marked [CITATION NEEDED for exact MR study; representative] — the lecturer's verification could not find an exact match at that journal/volume/page; the author's flagging is honest, but the citation should be replaced before external circulation

Verdict

Fit for purpose: Y, with one specific caveat. The pharmacology will hold up in conversation. The lecturer recommends Anthony not over-commit on the strong "CoQ10 is mostly a plasma effect, intramuscular CoQ10 is preserved" line, because the Laaksonen 1995 finding is one paper from short-term exposure and the long-term tissue picture is genuinely less settled than the document implies.


5. Clinical Pharmacist (fit-for-purpose-for-this-reader standard)

Overall score: 82/100. As a debate-preparation document for an intelligent non-clinician, this is unusually good. It will not embarrass Anthony in conversation; it may, in places, over-equip him with precision that exceeds the underlying evidence.

What is strong

What is weak or wrong

Citation-quality assessment (sampled)

Citation Verdict
Chapter 26 [^8] Byrne et al., JAMA Intern Med 2022;182:474–481, PMID 35285850 Real and correctly load-bearing
Chapter 28 [^27] STAREE protocol, Zoungas et al., J Am Heart Assoc 2024 Real (referenced via JAHA URL)
Chapter 28 [^29] PREVENTABLE design, Joseph et al., J Am Geriatr Soc 2023;71:1701–13, PMID 36974345 Real
Chapter 30 [^47] CARDS, Colhoun et al., Lancet 2004 Real
Chapter 30 [^51] SHARP, Baigent et al., Lancet 2011;377:2181–92 Real
Chapter 31 [^65] PURE, Dehghan et al., Lancet 2017;390:2050–62 Real and correctly identified

Verdict

Fit for purpose: Y. As a debate-preparation document for an intelligent non-clinician, this is the best of the panel's verdicts. The pharmacist recommends Anthony memorise Card 7 (honest position) and Chapter 33 (NNT/NNH table with evidence-weight column), read Chapter 20 (Roche footprint) before the conversation, and not over-commit on pending trials.


Panel Synthesis

Where the panel agrees

All five reviewers agree on the following points.

First, the document is more competent than would be expected from a 30-minute multi-agent orchestration. The biology, the trial summaries, the pharmacology, and the harms chapter are individually pitched at undergraduate-medical-school or specialist-pharmacist level. The reviewers were not expecting this. There is no chapter that is embarrassingly wrong; there is no citation that is straightforwardly invented (the explicit [CITATION NEEDED] flags are honestly disclosed, not concealed errors); the steelmanning of both camps is real, even where one panel member or another felt it was uneven.

Second, the document is fit-for-purpose for the stated use case — debate preparation for an intelligent non-clinician about to talk to an industry-employed friend. None of the five reviewers issues a "do not use" verdict. The lipidologist, the trialist, the methodologist, and the pharmacology lecturer all issue Y-with-reservations; the clinical pharmacist issues an unqualified Y.

Third, the document is over-precise in two specific places: the NNT/NNH-by-risk-stratum table (Chapter 33) and the master trial tables (end of Chapter 6, end of Chapter 14). The trialist and the pharmacist agree on this directly; the methodologist agrees structurally (over-precision is a calibration failure); the lipidologist agrees by implication; the pharmacology lecturer is neutral. The panel's collective recommendation is that Anthony should treat the numerical tables as orders of magnitude rather than point estimates, and should never quote a bare NNT integer in conversation without flagging "over X years, in this kind of population, against this comparator."

Fourth, the document has real but bounded asymmetries in its steelmanning. The methodologist's complaint that the calibrated landing position is rhetorically balanced rather than evidentially calibrated is endorsed by the lipidologist (who notes that the CoQ10 strong-form rebuttal is uneven) and by the trialist (who notes that JUPITER is read in a way that subtly favours the heterodox interpretation). The pharmacology lecturer notes that the prenylation outlier argument is rescued by appeal to pharmacogenomics without specifying which polymorphisms. The pharmacist notes that the medicalisation argument in Myth 8 is sociopolitical rather than evidentiary. Each of these is a small asymmetry; cumulatively they amount to a consistent tilt in the document — but not toward either camp. The tilt is toward the document's own preferred posture of "calibrated middle, agreeing with the consensus on the substantive science while conceding the heterodox process critiques." This tilt is internally consistent and is in fact what the document advertises in the cover note ("Steelmanned both sides, landing at a calibrated position"). The panel agrees that Anthony should know this is what he is reading.

Fifth, the bibliographic apparatus promised in the table of contents is not delivered. There is no consolidated Vancouver-style numbered bibliography; no glossary; no author/agent attribution and methods note. Footnotes are present within each Part, but cross-Part de-duplication has not been done, and the same primary source (e.g., Ference 2017 EAS consensus, Cohen-Hobbs 2006, Sabatine FOURIER 2017, Howard SAMSON) appears in multiple footnote lists with slightly different formatting in each. For an internal-use document, this is forgivable; for any external circulation, it is not.

Where the panel disagrees

The reviewers disagree in two main directions.

The methodologist is harsher than the other four. They score the document 64/100 — lower than any other reviewer by ten or more points. Their core complaint is structural: that the document does the right things at the meta-level (declared hierarchy, declared falsification criteria, declared evidence-weighting) but then applies them in a way that delivers the conclusion the orthodox position would have delivered without them. They note that the "what would change my mind" chapter does not include any conditions that would invalidate the LDL-causality core, even though such conditions are imaginable in principle. They note that "reasonable people can disagree" is repeated five or more times in Part VI without specifying the operational implication for an individual patient. The methodologist is the most demanding reviewer because Cochrane standards are the most demanding standards; the other four are working at journal-article or specialist-clinic standards which are somewhat more forgiving.

The clinical pharmacist is more lenient than the other four. They score the document 82/100 because the use case is debate-preparation for an intelligent non-clinician, not publication. Their argument: for this use case, the document is unusually good. The debate playbook works. The honest one-sentence position works. The Roche footprint chapter works. The methodologist would not score the document this leniently because Cochrane standards do not flex for use case; the pharmacist explicitly does.

The other three reviewers (lipidologist 78, trialist 71, pharmacology lecturer 76) cluster in the mid-to-high seventies, which is the panel's median.

Consolidated strengths

Consolidated weaknesses

Collective verdict

The panel's collective verdict is fit for purpose for the stated use case, with three operational recommendations to Anthony before he walks into the conversation.

First, read the document at least twice before the conversation, with a specific eye to where the steelmanning is uneven. The document's strongest claims (LDL is causal; statins reduce events in secondary prevention; HDL pharmacological raising is a failed strategy; FH is the cleanest natural experiment) are well-supported and Anthony can quote them with confidence. The document's weakest claims (specific NNT integers in low-risk primary prevention; the 90% nocebo ratio as if it generalised to all populations; the dismissal of the strong-form CoQ10 hypothesis on the basis of two under-powered supplementation trials) should be quoted as ranges and with caveats, not as point estimates.

Second, memorise Card 7 (honest position) and Chapter 34 (calibrated landing one-sentence) as the position Anthony will defend in conversation. These are the strongest single artefacts the document produces and they will hold up against a well-informed interlocutor. Card 8 (Roche-specific aside) should be used once and not over-used. The Chapter 33 NNT/NNH table should be referred to but not read from verbatim — the numbers are over-precise relative to underlying evidence.

Third, do not over-claim on pending readouts. STAREE, PREVENTABLE, ORION-4, Lp(a)HORIZON, OCEAN(a)-Outcomes, ACCLAIM-Lp(a), VESALIUS-CV, PREVAIL (obicetrapib): the document correctly flags each of these as awaiting data. An industry-employed friend may have access to conference-presented interim signals or pre-publication abstracts the document does not have. If pressed on a pending readout, Anthony's correct response is "I'm not committing on that until full publication," not "the document I read predicts X."

The panel notes one further thing the document does not say but probably should. The document is itself a multi-agent LLM-orchestrated product (per the cover note: "Compiled by: Multi-agent research orchestration with citation audit"). Whatever a careful clinical reviewer would think about that as a meta-fact is up to the reviewer, but the panel notes that the document is not pretending otherwise, and the panel's own scores would have been somewhat lower had the document been produced under that disclosure but with substantially more citation errors. The disclosure is honest; the product is mostly good; the gaps are real but bounded. Anthony should walk into the conversation knowing he is well-prepared but not invulnerable, and the document does not pretend to make him invulnerable.

Final panel mean: 74.2/100. A solid B; not an A. A reasonable basis for a substantive conversation; not a reasonable basis for issuing strong claims about specific NNTs or about the closure of specific controversies. The author's own framing — "calibrated position with explicit residual uncertainty" — is approximately the right framing for what the document delivers, with the caveat that the calibration is genuine on the science and slightly looser on the operational implications.