Title: Continued Citation of Bad Science and What We Can Do About It
Abstract: Even papers that falsify data continue to be cited. I describe network and text analysis for studying how authors continue to cite bad science: articles retracted from the literature due to serious flaws or errors. Jodi will present an in-depth case study of a human trial cited for over 10 years after it was retracted for falsifying data. Then, will describe how the team scaled up to study a data set of 7000 retracted papers and hundreds of thousands of citations. Finally, Jodi will discuss an ongoing Sloan-funded stakeholder consultation that is bringing editors, publishers, librarians, researchers, and research integrity experts together to address this problem.
LAMP PCR.pptx by Dr. Chayanika Das, Ph.D, Veterinary Microbiology
Continued citation of bad science and what we can do about it--2021-02-19
1. Continued Citation of Bad Science and
What We Can Do About It
Jodi Schneider
School of Information Sciences
University of Illinois at Urbana-Champaign
jodi@illinois.edu
Twitter:@jschneider
infoqualitylab.org
Syracuse iSchool, Center for Computational & Data Science
2020-02-19
2. What is retraction?
“Retraction is a mechanism for correcting the literature and
alerting readers to articles that contain such seriously flawed or
erroneous content or data that their findings and conclusions
cannot be relied upon.”
COPE Council. COPE Guidelines: Retraction Guidelines. Version 2 November 2019
https://doi.org/10.24318/cope.2019.1.4
3. What is retraction?
“Retraction is a mechanism for correcting the literature and
alerting readers to articles that contain such seriously flawed or
erroneous content or data that their findings and conclusions
cannot be relied upon.”
“Prompt retraction should minimise the number of researchers
who cite the erroneous work, act on its findings, or draw
incorrect conclusions such as from ‘double counting’ redundant
publications in meta-analyses or similar instances."
COPE Council. COPE Guidelines: Retraction Guidelines. Version 2 November 2019
https://doi.org/10.24318/cope.2019.1.4
5. Plan for today
• Case study of a retracted paper, analyzing text & network data
• Large-scale study of almost 50,000 citation contexts
• Finding knowledge dependencies using argumentation theory
& citation context analysis
• Stakeholder engagement to address the problem
9. Retraction doesn’t stop citation.
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a fraudulent clinical trial report, eleven years
after it was retracted for falsifying data.” In Scientometrics. http://doi.org/10.1007/s11192-020-03631-1
10. Retraction doesn’t stop citation.
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a fraudulent clinical trial report, eleven years
after it was retracted for falsifying data.” In Scientometrics. http://doi.org/10.1007/s11192-020-03631-1
5 out of 112 post-retraction citations
mention the retraction.
The remaining 107 use the paper –
and its faked data – as normal
science.
11. Since its 2008 retraction, this RCT paper has been cited over 100 times.
Only 4% (5/112) mention the retraction.
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a fraudulent clinical trial report, eleven years after it was retracted for
falsifying data.” In Scientometrics. http://doi.org/10.1007/s11192-020-03631-1
12. Retraction doesn’t stop citation.
Only 5 mention the retraction
2008
32 direct citations
63 indirect published 2008
2014 2019
114 direct citations
256 indirect published 2014
148 direct citations
108 indirect published 2019
Only 2 mention the retraction
retraction
RCT Retracted:
Author falsified data
07
Paper (black)
n articles (blue): 22
on articles (red): +25
2008
Matsuyama Paper (black)
# of first−generation articles (blue): 32
# of second−generation articles (red): +63
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a fraudulent clinical trial
report, eleven years after it was retracted for falsifying data.” In Scientometrics. http://doi.org/10.1007/s11192-020-03631-1
13. Described the Matsuyama paper’s methods and/or results (but not the retraction):
35 direct citations 2010-2019 (blue squares)
Cited by 161 articles (red squares)
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a
fraudulent clinical trial report, eleven years after it was retracted for falsifying data.” In Scientometrics.
http://doi.org/10.1007/s11192-020-03631-1
Specific citation 2010−2019
14. “Health benefits of flaxseed” (Fitzpatrick, 2011)
Government research bulletin from Nepal (Jha, 2016)
Spread of misinformation to a second generation
which cites the retracted paper as “evidence”
of the anti-inflammatory impact of flax.
“Whilst it is true that very little
ALA converts to the long chain
polyunsaturated omega-3 found in
marine oils, it does have beneficial
effects itself (Fitzpatrick, 2011).”
cites a book chapter
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a
fraudulent clinical trial report, eleven years after it was retracted for falsifying data.” In Scientometrics.
http://doi.org/10.1007/s11192-020-03631-1
15. An Irish nutritional support shop
recommends n-3 fats to athletes
(Healthy fats, fish oils & omega-
3 supplementation, 2017)
Spread of misinformation to a second generation
which cites the retracted paper as having demonstrated “Improved 6 min walk test,
decreased leukotriene B4 level, TNF-alpha, IL-8 [91]” (i.e., the faked data for which the
Matsuyama paper was retracted)
“During periods of illness, this may help promote
recovery and faster return to training. Interestingly, n-3
fats are sometimes provided to COPD patients (severe
airway damage and breathing difficulties) and prior to
surgery in order to support the immune system and
speed recovery by helping to control inflammation and
infection, and repair damaged cells17.”
“Immunologic impact of nutrient depletion in chronic obstructive pulmonary disease”,
(Herzog & Cunningham-Rundles, 2011)
cites an article
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a
fraudulent clinical trial report, eleven years after it was retracted for falsifying data.” In Scientometrics.
http://doi.org/10.1007/s11192-020-03631-1
16. A pre-clinical study on lung
repair following dust exposure
(Nordgren et al., 2018)
Spread of misinformation to a second generation
where the only cited “evidence” for the effect of n-3 PUFAs on inflammation in COPD
came from the retracted Matsuyama paper.
“Furthermore, studies reveal diets high in omega-3
polyunsaturated fatty acids (n-3 PUFA) may be
beneficial in inflammatory lung conditions, including
asthma and COPD (17).”
“Beneficial effects of n-3 PUFA on chronic airway inflammatory diseases” (Giudetti
and Cagnazzo, 2012)
cites this article to motivate its work:
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a
fraudulent clinical trial report, eleven years after it was retracted for falsifying data.” In Scientometrics.
http://doi.org/10.1007/s11192-020-03631-1
17. Difficult to get to the retraction notice via database
search
• We tested 8 databases: Only 1 (EMBASE) had a working link to the retraction notice!
• Linking errors give the impression that the retraction notice doesn’t exist!
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. 2020. “Continued post-retraction citation of a
fraudulent clinical trial report, eleven years after it was retracted for falsifying data.” In Scientometrics.
http://doi.org/10.1007/s11192-020-03631-1
19. Large-scale study of citations in biomedical papers
Started with:
• Retracted papers: All 7813 retracted papers in PubMed as of August
2020
• Citation contexts: All papers in PubMed Central open access XML subset
as of May 2019 (Hsiao, T.-K., & Torvik, V. I., in preparation).
Our data set:
49,236 citation contexts citing 4611 retracted papers
Included 13,364 post-retraction citation contexts citing 2765
retracted papers
Tzu-Kun Hsiao & Jodi Schneider. Continued Use of Retracted Papers: Temporal Trends in Citations and Awareness of
Retractions Shown in Citation Contexts. Working paper https://osf.io/5z2n4/?view_only=ab8e49facd5d4372b52d1419af8b9651
20. Large-scale study of citations in biomedical papers
Examples from the 13,364 post-retraction citations in our data set:
• A clinico-histopathologic study in rabbits confirmed that PRP treatment
can achieve a faster wound healing rate |B8|.
• However, to date, only one human study has demonstrated an induction
of SIRT1 mRNA level in PBMCs |sbref41|.
Tzu-Kun Hsiao & Jodi Schneider. Continued Use of Retracted Papers: Temporal Trends in Citations and Awareness of
Retractions Shown in Citation Contexts. Working paper https://osf.io/5z2n4/?view_only=ab8e49facd5d4372b52d1419af8b9651
21. Retracted papers are not cited differently from
non-retracted papers
Tzu-Kun Hsiao & Jodi Schneider. Continued Use of Retracted Papers: Temporal Trends in Citations and Awareness of
Retractions Shown in Citation Contexts. Working paper https://osf.io/5z2n4/?view_only=ab8e49facd5d4372b52d1419af8b9651
22. Few authors show awareness of retraction
• 613 citation contexts out of 13,000+ (4.6%) post-retraction mentioned
the retraction
• These typically address previous accomplishments or historical events.
• May provide underlying data for the study (e.g. study of ethics, retraction, etc.)
Tzu-Kun Hsiao & Jodi Schneider. Continued Use of Retracted Papers: Temporal Trends in Citations and Awareness of
Retractions Shown in Citation Contexts. Working paper https://osf.io/5z2n4/?view_only=ab8e49facd5d4372b52d1419af8b9651
23. Few authors show awareness of retraction
• 95% of post-retraction citations do not show awareness of the
retraction
Tzu-Kun Hsiao & Jodi Schneider. Continued Use of Retracted Papers: Temporal Trends in Citations and Awareness of
Retractions Shown in Citation Contexts. Working paper https://osf.io/5z2n4/?view_only=ab8e49facd5d4372b52d1419af8b9651
25. Fu, Yuanxi, Jodi Schneider. 2020. “Towards knowledge maintenance in scholarly digital libraries with
keystone citations.” In JCDL 2020, 217–226. doi:10.1145/3383583.3398514
When a paper gets retracted, does that impact
the validity of a paper that cites it?
26. It depends on the argument structure!
Retraction Watch: https://retractionwatch.com/2020/12/09/authors-of-
meta-analysis-on-heart-disease-retract-it-when-they-realize-a-nejm-
reference-had-been-retracted/
27. Conclusions of a systematic reviews and meta-
analyses depend on EACH article they synthesize!
Retraction Watch: https://retractionwatch.com/2020/12/09/authors-of-
meta-analysis-on-heart-disease-retract-it-when-they-realize-a-nejm-
reference-had-been-retracted/
Fu, Yuanxi, Jodi Schneider. 2020. “Towards knowledge maintenance in
scholarly digital libraries with keystone citations.” In JCDL 2020,
217–226. doi:10.1145/3383583.3398514
28. Citing Article Cited Article
Cites
Support arguments
Under my keystone citation
framework:
1) A scientific research paper puts forward
at least one main finding, along with a
logical argument, giving reasons and
evidence to support the main finding.
2) The main finding is accepted (or not) on
the basis of the logical argument.
3) Evidence from earlier literature may be
incorporated into the argument by citing a
paper and presenting it as support, using
a citation context.
Citation Contexts
Arguments
modeled
Fu, Yuanxi, Jodi Schneider. 2020. “Towards knowledge maintenance in scholarly digital libraries with
keystone citations.” In JCDL 2020, 217–226. doi:10.1145/3383583.3398514
29. Step 4
Flag those articles
that are potentially
impacted
Workflow for Flagging Problematic Citations
Step 2
The domain expert
develops a list of screening
questions
Step 3
Experts/non-experts/text
mining tools screen target
articles using the checklist
Step 1
The domain expert develops
a generalized argument
model
Fu, Yuanxi, Jodi Schneider. 2020. “Towards knowledge maintenance in scholarly digital libraries with
keystone citations.” In JCDL 2020, 217–226. doi:10.1145/3383583.3398514
31. Reducing the Inadvertent Spread of Retracted Science
2020-
2021
Jodi Schneider jodi@illinois.edu
Information Quality Lab https://infoqualitylab.org/
Collaboration across diverse stakeholders: funders, editors, peer reviewers, authors,
publishers, database providers, research integrity officers, science journalists
Advisory Board
National Center
for Professional
and Research Ethics
Retraction Watch
JAMA
Annettte Flanagin
RN, MA, FAAN
Ivan Oransky, MD
C.K. Gunsalus, JD
London School of
Economics and Political
Science
Daniele Fanelli, PhD
32. Jodi Schneider MT Campbell Nathan Woods Yuanxi Fu Tzu-Kun (Esther) Hsaio
Randi Proescholdt Vivien Yip Yoss Arianlou Halle Burns
RISRS 2020 Team
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
33. The project overall is trying to address these 4 research questions:
1.What is the actual harm associated with retracted research?
2.What are the intervention points for stopping the spread of retraction? Which
gatekeepers can intervene and/or disseminate retraction status?
3.What are the classes of retracted papers? (What classes of retracted papers
can be considered citable, and in what context?)
4.What are the impediments to open access dissemination of retraction statuses
and retraction notices?
Project Motivation
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
34. Project Timeline
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
35. ● Involved about 70 people from across scholarly communication
● Interviewed 50 people from roles in research integrity, technology,
academia, libraries, publishing, journalism
● Held 3 4-hour workshops each engaging ~30 people
○ Day 1: Listening and Learning
○ Day 2: Collaborative Agenda Setting
○ Day 3: Commitment to Action & The Way Forward
Stakeholder Engagement To Date
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
36. ● Initial recommendations developed through discussion & survey
● Project website with web bibliography
● Research outcomes
○ Bibliometrics & Research Assessment Symposium 2020 talk
○ SIGMET 2020 workshop poster on data quality
○ Scientometrics paper on case study, "Continued post-retraction citation of a
fraudulent clinical trial report, 11 years after it was retracted for falsifying data”
● Two industry conference presentations scheduled
○ NISO Plus 2021: Thursday, February 25, 11:30 to 12:45 PM Eastern
○ Society for Scholarly Publishing 2021, May 24-27 (details to come)
Dissemination To Date
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
37. Misinformation and truth: from fake news to
retractions to preprints
NISO Plus 2021
Thursday, February 25, 11:30 to
12:45 PM Eastern
https://sched.co/fMnP
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
38. ● A coherent, actionable set of recommendations for next steps
● A white paper with the recommendations & context (preprint in March)
● Backgrounder to White House Office of Science & Technology Policy
● Dissemination of recommendations to professional organizations
● More research publications (citation analysis to QSS, lit review to ASIST)
● Proposals for future research
● Collaborations to start achieving the recommended outcomes
Outputs To Come - by July 2021
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
39. i. Make retraction information easy to find and use.
ii. Recommend retraction metadata and a taxonomy of retraction
statuses that can be adopted by all stakeholders.
iii. Develop best practices for coordinating the retraction process.
iv. Educate & socialize researchers and the public about retraction and
post-publication stewardship of the scientific record.
v. Develop standards, software, and databases to support sustainable
data quality.
Draft Recommendations v1.5
Reducing the Inadvertent Spread of Retracted Science
https://infoqualitylab.org/projects/risrs2020/
40. Bibliography
Cheng, Yi-Yun, Nikolaus Parulian, Tzu-Kun Hsiao, Ly Dinh, Janina Sarol, Jodi Schneider. 2019. “ReTracker: actively and automatically matching retraction metadata in Zotero.”
In ASIS&T Annual Meeting, 56(1): 372–376. doi:10.1002/pra2.32
COPE Council. COPE Guidelines: Retraction Guidelines. Version 2 November 2019 doi:10.24318/cope.2019.1.4
Dinh, Ly, M. Janina Sarol, Yi-Yun Cheng, Tzu-Kun Hsiao, Nikolaus Parulian, Jodi Schneider. “Systematic examination of Pre- and Post-Retraction Citations.” In ASIS&T Annual
Meeting, 56(1): 390–394. doi:10.1002/pra2.35
Fu, Yuanxi, Jodi Schneider. 2020. “Towards knowledge maintenance in scholarly digital libraries with keystone citations.” In JCDL 2020, 217–226.
doi:10.1145/3383583.3398514
Harris, Richard. 2020 March 26. “In Defense Of Coronavirus Testing Strategy, Administration Cited Retracted Study.” NPR. https://www.npr.org/sections/health-
shots/2020/03/26/822084429/in-defense-of-coronavirus-testing-strategy-administration-cited-retracted-study
Piller, C. (2021). Many scientists citing two scandalous COVID-19 papers ignore their retractions. Science.
Price, Gary. 2019 June 12. InfoDocket. Zotero and Retraction Watch Collaborate on New Service (Beta) That Notifies Users of Article Retractions in Their Personal Zotero
Libraries. https://www.infodocket.com/2019/06/12/zotero-and-retraction-watch-collaborate-on-new-service-beta-that-notifies-users-of-article-retractions-in-their-
personal-zotero-libraries/
Proescholdt, Randi & Jodi Schneider. 2020. Retracted Papers with Inconsistent Document Type Labeling in PubMed, Scopus, and Web of Science METRICS 2020 at ASIS&T. Poster presented
October 22, 2020 at the SIGMET METRICS 2020 workshop at ASIS&T 2020. http://jodischneider.com/pubs/sigmet2020.pdf
Reducing the Inadvertent Spread of Retracted Science: Shaping a Research and Implementation Agenda workshop: https://infoqualitylab.org/projects/risrs2020/
Retraction Watch: https://retractionwatch.com/2020/12/09/authors-of-meta-analysis-on-heart-disease-retract-it-when-they-realize-a-nejm-reference-had-been-retracted/
Schneider, Jodi, Di Ye, Alison M Hill, Ashley S Whitehorn. December 2020. “Continued post-retraction citation of a fraudulent clinical trial report, eleven years after it was
retracted for falsifying data.” In Scientometrics, 125(3):2877–2913. doi:10.1007/s11192-020-03631-1 Preprint:
http://jodischneider.com/pubs/scientometrics2020.pdf
44. Randi Proescholdt & Jodi Schneider. 2020. Retracted Papers with Inconsistent Document
Type Labeling in PubMed, Scopus, and Web of Science METRICS 2020 at ASIS&T
Many retracted articles are not indexed properly.
46. How big is retraction & citation to retraction?
• 800,000 articles directly cite a retracted paper.
(Rough estimate from Fu & Schneider 2020)
• The Retraction Watch Database lists over 19,000 retracted
papers. PubMed: nearly 8,000.
• In biomedicine 94% of retracted papers have received at least
one citation, with an average citation count of 35 (Dinh, …, Schneider 2019)
Dinh, …, Schneider 2019: “Systematic examination of Pre- and Post-Retraction Citations.” doi:10.1002/pra2.35
Fu & Schneider 2020: “Towards knowledge maintenance in scholarly digital libraries with keystone citations.”
doi:10.1145/3383583.3398514