Credibility Signals

Abstract

This document specifies various types of information, called credibility signals, which are considered potentially useful in assessing credibility of online information.

1. Introduction

1.1 Purpose

This document is intended to support an ecosystem of interoperable credibility tools. These software tools, which may be components of familiar existing systems, will gather, process, and use relevant data to help people more accurately decide what information they can trust online and protect themselves from being misled. We expect that an open data-sharing architecture will facilitate efficient research and development, as well as an overall system which is more visibly trustworthy.

The document has three primary audiences:

Software developers and computer science researchers wanting to build systems which work with credibility data. For them, the document aims to be a precise technical specification, stating what they need for their software to interoperate with any other software which conforms to this specification.
People who work in journalism and want to review and contribute to this technology sphere, to help make sure it is beneficial and practical.
Non-computer-science researchers, interested in helping develop and improve the science behind this work.

In general, we intend for this document to be:

Welcoming for implementers of systems using credibility data
Easy for non-tech folks to understand the proposed signals & contribute
Practical to maintain by the editors
Practical to contribute to, for a wide audience
A source of accurate guidance about signal quality and adoption

1.2 Credibility Data

The document builds on concepts and terminology explained in Technological Approaches to Improving Credibility Assessment on the Web. Our basic model is that an entity (human and/or machine) is attempting to make a credibility assessment — to predict whether something will mislead them or others — by carefully examining many different observable features of that thing and things connected with it, as well as information provided by various related or trusted sources.

To simplify and unify this complex situation, with its many different roles, we model the situation as a set of observers, each using imperfect instruments to learn about the situation and then recording their observations using simple declarative statements agreed upon in advance. Because those statements are inputs to a credibility assessment process, we call them credibility signals. (The term credibility indicators is sometimes also used.)

This document, then, is a guide to these signals. It states what each observer might say and exactly how to say it, along with other relevant information to help people choose among the possible signals and understand what it means when they are used.

Because this is a new and constantly-changing field, we do not simply state which signals should be used. Instead, we list possible signals that one might reasonably consider using, along with information we expect to be helpful in making the decision.

1.3 Example

[explain]

Assessing credibility of https://news.example/article-1

Looking at title

I consider it to be clickbait

It's clickbait because it's a cliffhanger

Looking at article

It cites scientific research

Looking at provider

Established in 1974

Owned domain since 2006

1.4 Factors in Selecting Signals

When building systems which use credibility signals and trying to decide which signals to use, there are different factors to weigh. This section is aspirational; we hope this document will in time provide guidance on all these factors.

1.4.1 Measurement Challenges

There are factors about how difficult it is to get an accurate value^[a]^[b] for the signal^[c]^[d]:

Do people independently observing it get approximately the same value?
Do observations vary with the culture, location, language, age, beliefs, etc, of the people doing the observation?
Would the same people make the same observation in future months or years?
How much time and effort does it take people to make the observation?
Do people need to be trained to make this specific observation?
What kind of general training do people need (eg a journalism degree) to do it?
How do machines compare to humans in making this observation, in terms of cost, quality, types of errors, and susceptibility to being tricked.

Many of these factors can be measured using inter-rater reliability (IRR) techniques. When studies have made such measurements, our intent is to include that data in this document.

Here is a table of the data we have. Excerpts are listed with the relevant signals.

Study

Signal Called

IRR

Rel Expts

p <

# obsr

obsv per

Zhang18

Title Representativeness

0.367

0.234

Zhang18

“Clickbait” Title

0.581

-0.709

0.001

Zhang18

Quotes from Outside Experts

0.673

0.327

Zhang18

Citation of Organizations

0.283

0.145

Zhang18

Citatation of Studies

0.763

0.107

Zhang18

Single Study Article

0.877

0.031

Zhang18

Confidence - Extent Claims Justified

-0.093

0.690

0.001

Zhang18

Confidence - Acknowledge Uncertainty

0.534

-0.247

Zhang18

Logical Fallacies - Straw Man

-0.096

-0.402

0.050

Zhang18

Logical Fallacies - False Dilemma

0.102

-0.303

Zhang18

Logical Fallacies - Slippery Slope

0.478

0.374

0.050

Zhang18

Logical Fallacies - Appeal to Fear

0.314

-0.424

0.050

Zhang18

Logical Fallacies - Naturalistic

0.377

-0.533

0.010

Zhang18

Tone - Emotionally Charged

0.098

0.611

0.001

Zhang18

Tone - Exaggerated Claims

0.235

0.606

0.001

Zhang18

Inference - Type of Claims

0.154

0.029

Zhang18

Inference - Convincing Evidence

0.540

0.764

0.001

Zhang18

Originality

0.346

Zhang18

Fact-checked

0.303

0.050

Zhang18

Representative Citations

0.312

0.001

1.4.2 Value in Credibility Assessment

Another important set of factors relates to how useful the measurement is in assessing credibility, assuming the observation itself is accurate.

Does the signal have a strong correlation to content accuracy, itself determined by consensus among experts^[e]^[f]?
Is it particularly indicative of credibility when used in combination with other signals? (For example, as part of computing the value of a latent variable.)
Is it conceptually easy for people to understand?
Do professionals in the field think it's likely to be a useful signal?
How dependent are these characteristics on the culture or time period being considered?
How dependent are these characteristics on the subject matter of the information being assessed for credibility?

1.4.3 Feedback Risks (“Gameability”)

One should also consider how the overall ecosystem of content producers and consumers might be changed by credibility tools adopting the signal. Once attackers see it’s being used, a signal that works well today might stop working, or even be used to make things worse. See Feedback Risks.

Is it disproportionately useful for attackers (eg viral call to action) ? If so, making this a negative credibility signal should generally be beneficial
Is it disproportionately expensive for attackers (eg journalistic language) ? If so, making this a positive credibility signal should generally be beneficial.
Who might get impacted by “friendly fire”? Even if adopting a signal might — on average — harm attackers more than everyone else, certain individuals or communities who have done nothing wrong might be penalized. Tradeoffs must be carefully made, ideally in a consensus process with the impacted people.

1.4.4 Interoperability

The value of sharing signal data depends on how that signal is used by other systems.

Are others producing data using this signal?
Are there useful data sets available?
Are others consuming data, paying attention to reported observations of this signal?
Are there tools which work with it, eg running statistics?
Is the definition clear and unambiguous, so people using it mean the same thing?
Are there clear examples?
Is there an open history of commentary, with questions and answers, and issues being addressed by various implementers?
Is documentation available in multiple languages?
If the definition is under development, how can one participate?
If the definition could possibly change, who might change it, and under what circumstances?
Are there any intellectual property considerations? See W3C Patent Policy.
Is there a test suite / validation system for helping confirm that an implementation is working properly?
Are there implementation reports, confirming that tools are functioning properly, according to the testing system? (For an example, see ActivityPub).

1.5 Publishing Credibility Data

TBD, basically follow schema.org technique using JSON-LD.

1.6 Consuming Credibility Data

TBD, point to some tools and the relevant specs. Basically JSON-LD.

1.7 Organization of this document

Section 1 (“Introduction”) provides instructions for how to use and help maintain this document, along with general background information.

The rest of this document, after the introduction, is a list of signals and information about them, as discussed in the introduction. The signals are organized into related groups, in hierarchical sections. At the lower levels of the hierarchy are the signals themselves, while the higher levels provide grouping of the signals, to help people understand them.

One important level of the hierarchy identifies the subject type of the signal. This is the conceptual entity being examined, considered, or inspected, when one makes the observation being recorded in the signal data. This could be imagined in different ways: when you are observing a claim made in the 3rd paragraph of an article published in some newspaper, are you observing the claim, the paragraph, the article, the newspaper, or even the author of the article? In general, we aim for the smallest granularity that makes sense, which in this case would probably be the claim.

At times, it may not be obvious to which subject type a signal belongs, or it could sensibly belong with several different ones. In this case, it might be moved to a different section in the document as people come to understand it better. When it’s not clear, there should be links from the places a signal could reasonably be to the place it actually is.

This may require discussion, and might remain open for debate. When a signal or group of signals makes sense in two places, consider linking it from the places it isn’t, to help people find it.

In many cases, a signal could be seen as a set of similar signals which are not strictly identical. This can be handled by adding additional signal headings with the finer distinction, when necessary. In this case, template statements might appear under more than one signal.

Note that sections may be moved and renumbered. Do not rely on section numbers remaining the same. For linking to a part of the document, consider using the gdocs h.xxxxxx fragment ids, provided by the Table of Contents; those should remain stable. Also, whenever changing a heading, especially a signal heading, if someone might be referring to it by name, please move the old text into a paragraph starting “Also called:”.

1.8 Template Statements

The most important thing about a signal definition is to be clear what observation the signal data is recording. If the signal heading is “Article length”, does that mean length in words or bytes or characters or some other metric? Does it include the title? For each signal, we want an easy way to communicate its definition that is short but clear, while being as detailed as necessary.

The technique we use here is to express the semantics of the signal using plain and simple sentences in natural language which convey the same knowledge as the signal data. If you imagine people using credibility software exchanging these statements (perhaps in text messages or on Twitter), you should get the right semantics. You can assume metadata, like who sent it and when it was sent is available, so the statements can include terms like “I” and “now”.

For machine-to-machine data Interoperability, these template sentences and the signal heading are turned into a data schema, after which the JSON-LD/schema.org/sematic web/linked data technology stack can be used.

The statements we use are templates because they abstract over a variety of similar sentences which differ in specific limited ways. For example, these statements:

I have examined the article at https://example.com/alice and find it highly credible
I have examined the article at https://example.com/brian and find it highly credible
I have examined the article at https://example.com/casey and find it highly credible

are all the same, except in the URL. We convey this using a template statement, which has a variable portion in square brackets, like:

I have examined the article at [subject] and find it highly credible

Tech note

If we (automatically or manually) map this template to a property with the pname :iHaveExaminedHighlyCredible, then the sentence number 2 above would be encoded in turtle as

{ <https://example.com/brian> :iHaveExaminedHighlyCredible true }.

Alternatively, we could make it a class, but boolean valued properties may be better, so that all signals remain as properties..

The bracketed template expression “[subject]” is required in every template, to indicate what entity is being observed. Additional bracket expressions can be used when there are other elements of the statement to make variable. In particular, [string] (for text in quotes) and [number].

(For now, try to just use those three. Software and documentation is being developed to allow more features. If you find this too restrictive, go ahead and write something else inside the square brackets and we'll deal with it later, but include a question mark so it's clear you knew you were making it up.)

An example needing multiple variables:

https://example.com/alice took 4.75 seconds to load, just now.
https://example.com/brian took 5.9 seconds to load, just now.

could be matched by:

[subject] took [number] seconds to load, just now.

1.9 Instructions for editing this document

As an experiment, this document is currently set so everyone can edit it, like Wikipedia. It is the Google docs version that is editable. We suggest you change the “Editing Mode” to “Suggesting” (using the pencil icon in the upper-right) until you are quite familiar with this document. You may also comment using the usual Google Docs commenting features.

If you make or suggest any edits to this document, you are agreeing to the W3C Community Contributor License Agreement which has significant copyright and patent implications.

The subsections below give some advice for how to make edits which are helpful.

1.9.1 Expand discussion

Each section should begin with a short introduction written with a neutral point of view, reflecting consensus about why the signal might be useful and what the risks might be. To enable consensus among a broad community, the intent is for this text to be developed iteratively, with each contributor adding their perspective while respecting what is already present.

Questions and minor concerns should generally be added as annotations using the “Add a Comment” function, without editing the document. If they become issues requiring back-and-forth discussion, they should be turned into github issues and linked from the most relevant place in this document with a paragraph starting “Issue:”

These discussion sections are intended to be nonnormative. That is, they do not say how software using the signal is required to behave for interoperability. The normative content of this specification is the template statements and the mapping of the statements to RDF.

1.9.2 Add new template statements

If you are confident you understand what a signal is intended to measure, and think you can provide a template statement which expresses it more clearly and simply, with little ambiguity, please add a new row to the bottom of the “Proposed template statements” table and add your entry. Please also put the next higher number in the Key field for reference, and your name in the By field. This “by” field is optional; it is intended to help simplify discussion, telling people who to talk to, and to give some credit. Listing the name of a large group in this field is not particularly useful.

After adding an entry, for a short time (perhaps a few hours, guided by any comments on it) it’s okay to edit it if you change your mind. After that, please leave it, and just add a new row for the new version. You can put new versions in the middle of the table and use keys like 1a.

1.9.3 Add new signals

Once you’re familiar with the structure of this document and all the signals in your area of interest, you may add new signal sections (with a title starting “Signal:” or even new group sections. (For heading numbering, you can use the “Table of contents” add-on from LumApps to number the headers. Or just leave the numbering for someone else using the add-on.)

When you add a new signal, please copy this table to the new section, and then fill in at least one row to clarify what the signal data conveys.

Key	Proposed Template Statement	By

1.10 Contributors

Folks who add content to this document are encouraged to add themselves in this section, potentially with some affiliation & credential information. This also allows the “By” column to stay short, as people can use short forms of names (eg only first or last name, if unique in this doc).

Entries marked as by “Credibility Coalition” are prior work by members of the Credibility Coalition. At the time, individual authorship information was not maintained. Moving forward, specific authorship detail is welcome.
(add yourself here...)

1.11 Sources

This document is assembled from multiple data sources. They provide both the overall structure of this document and the details about each signal, include definitions, example data, and implementation status.These sources are fetched started with a source list, which appears as the first entry below. In general, text in this document links back to its source with a link-out icon.

The sources used for this current view were:

Source

Time Loaded

Source List

2020-02-02T21:21:56.155Z

Root

2020-02-02T21:21:56.158Z

CSVDemo

2020-02-02T21:21:56.426Z

Zhang18

2020-02-02T21:21:56.421Z

Hawke

2020-02-02T21:21:56.427Z

cciv

2020-02-02T21:21:56.637Z

If you want to privately experiment with bookmarkable alternative views generated using a different source list, try Custom View of Credibility Signals.

2. Subject type: Claim

This section is for signals about claims.

A claim is “an assertion that is open to disagreement; equivalently, a meaningful declarative sentence which is logically either true or false (to some degree); equivalently, a proposition in propositional logic.” [credweb report]

Claims can be stated (with various decree of clarity) in some content or implied by the content (even non-textual content, like a photograph).

Claims are usually the smallest practical granularity. Credibility data about claims is largely focussed on what other sources have said about that claim, as in fact checking, but could also involve relationships between claims and textual analysis of claim text.

2.1 Claim Review

The “ClaimReview” model developed at s^[g]^[h]^[i]chema.org grows out of the tradition of independent, external fact-checking, as in PolitiFact. With this model, a fact-checker reviews a claim, typically made by a public figure, and then publishes a review of that claim, a “claim review”. Within schema.org, this parallels other reviews, like restaurant reviews.

[ Can we fit claimreview neatly into this observer/signal model? It’s a bit of a stretch. TBD. ]

2.1.1 Signal: Fact-check status of claim

From Section 7.7.1. Signal: Article has a central claim, claims in articles according to Credibility Coalition WebConf2018 and more recent studies includes the following values for fact-check results at the time of the study: false, true, unclear, mixed; not finding a fact-check is equivalent to an empty statement.

Interoperability with ClaimReview: This signal seems to relate to https://schema.org/reviewRating and bestValue of https://schema.org/ClaimReview, with bestValue in this case equal to VERIFIED; further discussion is needed with members of schema.org to confirm.

2.1.2 Signal: Fact-check status of claim — VERIFIED

Ref	Definition (Template)	Tags
0bd70468	An IFCN signatory did a fact-check and verified claim [Claim].
cbd33df5	The fact-check result by [Venue] of claim [Claim] is that it is TRUE.

2.1.3 Signal: Fact-check status of claim — REFUTED

Ref	Definition (Template)	Tags
16c7ee48	An IFCN signatory did a fact-check and refuted claim [Claim].
97e6d1a0	The fact-check result by [Venue] of claim [Claim] is that it is FALSE.

2.1.4 Signal: Fact-check status of claim — UNCLEAR

Ref	Definition (Template)	Tags
c9434942	The fact-check result by [Venue] of [Claim] is UNCLEAR.

2.1.5 Signal: Fact-check status of claim — MIXED

Ref	Definition (Template)	Tags
9a0da6b7	The fact-check result by [Venue] of [Claim] is that the claim contains elements that are TRUE and FALSE.

2.1.6 Signal: Claim - Risk of Harm

Ref	Definition (Template)	Tags
d0d44331	[Claim] is a claim that asserts a risk of harm.

Developed for CredCo Political Indicators Study 2018-19. Can be used in connection with 7.2.3. Signal: Generalization/Characterization of Group.

2.1.7 Signal: Claim - Coded Meaning

Ref	Definition (Template)	Tags
9f86e8b2	[ClaimA] is a claim that equals another claim, [ClaimB].[j]

Developed for CredCo Political Indicators Study 2018-19. Original example question: “Are there claims that contain phrases, words, or coded language that have taken on a special loaded meaning, in the understanding of the speaker and audience?”, with an example of "go to work," used as code for killing during the Rwandan genocide.

Can be used in connection with 7.2.3. Signal: Generalization/Characterization of Group.

2.2 Fact-checking Organization[k][l]

Signals below [2.2.1. Signal: Fact-checking Organization commitments — member of the IFCN, 2.2.2. Signal: Fact-checking Organization commitments — accuracy and professionalism, and 2.2.3. Signal: Fact-checking Organization commitments — unknown ] were developed in combination with those under 7.8. Claims in Articles, and originally expressed as a question:

If the publication is from a fact-checking organization, what are its commitments to accuracy and other standards?

A) IFCN Signatory

B) Not IFCN signatory but organization/institution with similar standards and commitments

C) Unknown, not discernable

2.2.1 Signal: Fact-checking Organization commitments — member of the IFCN

Ref	Definition (Template)	Tags
177559e3	[Organization] which published fact-check [Webpage] is a member of the International Fact-Checking Network at Poynter (IFCN). on [date].[n][o][p][q]

2.2.2 Signal: Fact-checking Organization commitments — accuracy and professionalism

Ref	Definition (Template)	Tags
ec6b8ebf	[Organization] has expressed commitments to accuracy and other fact-checking professional standards similar to IFCN organizations.
35a66989	[Organization] has expressed commitments to accuracy and other fact-checking professional standards.

2.2.3 Signal: Fact-checking Organization commitments — unknown

Ref	Definition (Template)	Tags
c0632f51	[Organization]’s commitments to accuracy and other professional standards are unknown.

2.3 Explicitly Unverified Claims

From CredCo Political Indicators Study 2018-2019: in some cases, articles may reference claims or pieces of information that do not contain citations or references. In some cases, within an article, an author can make explicit reference to a claim that has not been verified, using language that specifies that the claim has not been validated or proven to be true. This includes language in an article explicitly referencing that a claim has not yet been verified to date - but the claim is being mentioned in the article nonetheless.

This is used in connection with 7.7.2. Signal: Article has a claim.

Key	Proposed Template Statement	By
1	[Claim] is explicitly unverified, containing language such as “charges have not been proven true.” ^[r]	CredCo

3. Subject type: Text

Includes: phrase, sentence, paragraph, document, document fragment

A text, in this sense, is a sequence of words, with the usual punctuation, and sometimes embedded multimedia content or meaningful layout, like tables. That is, it’s a document or portion of a document. As examples, a phrase, sentence, paragraph, document section, book chapter, book, and complete book series would typically each count as a text.

Signals here concern properties of the text, itself, separate from how it might be published (eg on a Web Page, on a billboard, spoken at a rally) or where it might be published (in some Venue). The text should be considered immutable: a text (in this sense) doesn't change. If you take a text and change it, you are making a new text, which needs to be reexamined, to see which observations (and thus which signal data) applies to this other, new text.

Issue: (tech) How to represent texts in RDF? Options include annotation URL with secure hash, annotation object URL with secure hash, data: URI, etc.

3.1 Formality

Texts adopt a tone to appeal to their audience and/or attempt to convey how the text should be used. For instance, an academic study is written in formal, verbose and grammatically correct language, while a listicle is short, informal and often humorous. The academic study uses these characteristics to convey authority, while the listicle is intentionally unauthoritative.

3.1.1 Signal: Formal tone

Ref	Definition (Template)	Tags
d721df20	Text of [subject article] has a formal tone.

3.1.1.1 Signal: Correct Spelling

Ref	Definition (Template)	Tags
0255f049	Text of [subject article] has a formal tone, as measured by correct spelling.

3.1.1.2 Signal: Correct Grammar

Ref	Definition (Template)	Tags
7bd59393	Text of [subject article] has a formal tone, as measured by correct grammar.

3.1.2 Signal: Informal tone

Incorrect or colloquial grammar, slang, and humor are some indications of informal tone.

Ref	Definition (Template)	Tags
381634fc	Text of [subject article] has an informal tone.

3.1.2.1 Signal: Slang

Ref	Definition (Template)	Tags
a5304038	Text of [subject article] has an informal tone, as measured by slang.

Example sentence: "In this moment we all learned that Johnny Depp isn't a teen and has no clue what "Bae" means." (Source)

3.1.2.2 Signal: Informal grammar

Ref	Definition (Template)	Tags
275e2271	Text of [subject article] has ALL CAPS words for emphasis.
d73fb6ec	Text of [subject article] has an informal tone, as measured by incorrect, casual or colloquial grammar.
1338ba72	Text of [subject article] has consecutive exclamation points.
6fc95996	Text of [subject article] has consecutive question points.

Example sentence: "If you're a Friends fan, you probably know that Ross and Rachel's relationship was...kind of a disaster 95% of the time." (Source)

3.2 References or citations

3.2.1 Signal: Uses standardized references or citations

These standards are required and enforced by professions that demand accuracy, and are typically found in highly researched, and therefore more authoritative, texts. Examples: Legal, academic, or scientific citations, e.g., MLA, APA.

Ref	Definition (Template)	Tags
c3b7d174	Text of [subject article] uses standardized references or citations.

Example sentence: "Changes in body temperature have long been used as an indicator of injury, inflammation or infection in veterinary medicine (George et al., 2014), however, the use of

temperature devices such as rectal thermometers and thermal microchips can be both invasive

and time consuming (Johnson et al., 2011)." (Source)

3.2.2 Signal: Uses formal but not standardized references or citations

Examples: Journalism, nonfiction or explanatory material

Some texts use references extensively, even if they are not written according to a rigid structure. These texts tend to be authoritative but not as authoritative as the texts using the rigidly structured citations. The content of the references is also extremely influential.

Ref	Definition (Template)	Tags
1e9f797d	Text of [subject article] uses references or citations that are not recorded according to professional standards.

Example sentence: "Families that receive benefits are now over $2,600 worse off every year, according to an analysis by the Child Poverty Action Group, an advocacy group." (Source)

3.2.3 Signal: Few to zero references or citations

A text with no references to other materials is original content, which often means it is opinion, personal experience, or even fiction. These tend to be less authoritative than texts with references.

Ref	Definition (Template)	Tags
baa89d32	Text of [subject article] has few or no references or citations.

One exception is a first-hand account, which can become a primary document for later research. These personal accounts, however, should be vetted and cross-referenced with other sources to evaluate its accuracy.

Example sentence: "The shrine is the work of SUNY Purchase sophomore Phillip Hosang, who, like a lot of students at the school, had long heard rumors about a secret room in a men's bathroom somewhere in the visual arts building." (Source)

3.3 Pronouns

3.3.1 Signal: Many or multiple instances of the pronouns "I" or "you"

Texts that use the pronouns "I" or "you" are typically opinion, correspondence or personal account. These texts are usually not trying to be authoritative or explanatory, however, they sometimes form a primary document that is used in secondary research.

Ref	Definition (Template)	Tags
43280bee	Text of [subject article] has many instances of the words "I" or "you."

Example sentence: "After paying close attention to many of your campaigns, I believe you are united by a desire to get things done to help a lot of people who’ve been left behind." (Source)

3.3.2 Signal: Few or no instances of "I" or "you"

Texts that do not use first or second person are less likely to be opinion content. However, this is no indication of credibility.

Ref	Definition (Template)	Tags
dcbf79f6	Text of [subject article] has few or no instances of the words "I" or "you."

"President Trump said he would not overrule his acting attorney general, Matthew G. Whitaker, if he decides to curtail the special counsel probe being led by Robert S. Mueller III into Russian interference in the 2016 election campaign." (Source)

3.4 Signal: Vocabulary or reading level

Texts with a wide and varied vocabulary, which may include jargon or uncommon words, is an indicator of formal tone.

3.5 Incivility and impoliteness

3.5.1 Signal: Incivility

Ref	Definition (Template)	Tags
fb2e81c3	Text of [subject article] contains stereotypes, such as calling a person a “faggot,” “terrorist,” or “backward” (e.g. “Muslims are terrorist sympathizers”)
ca34c8ce	Text of [subject article] contains threats to people’s individual rights, such as freedom of speech or personal freedom (e.g. “You foolish Republicans better shut up”)
929fa567	Text of [subject article] contains verbalized threat to democracy, such as a proposal to overthrow democratic government by force or undemocratic way (e.g. “Obama is a Muslim Agent with Brotherhood Ties. American people must take him down.”)

Source: Oz, M., Zheng, P., Chen, G. M., & Park, R. H. (2018). Twitter versus Facebook: Comparing incivility, impoliteness, and deliberative attributes. New Media & Society, 20(9), 3400–3419. http://doi.org/10.1177/1461444817749516

3.5.2 Signal: Impoliteness

Ref	Definition (Template)	Tags
700e99be	Text of [subject article] contain insults or name-calling (e.g. “stupid” or “moron”)
5491ed97	Text of [subject article] contains profanity (e.g. “hell” and “damn”)
ec40d14f	Text of [subject article] contains words in all capital letters (e.g. “Who flew the planes into the towers on 9/11? ILLEGAL IMMIGRANTS!”)

3.6 Text type

Editor note: This should probably be abstracted to all different types of contents.

3.6.1 Signal: Text type is news

Ref	Definition (Template)	Tags
18da5ac5	[Text] appears to be news.

3.6.2 Signal: Text type is opinion

Ref	Definition (Template)	Tags
e1f7bea2	[Text] appears to be an opinion piece
76519eb9	[subject article] URL contains directory name or file name indicating opinion
639cf2d6	[subject article] is self-labeled opinion

Examples: #2: Opinion, Perspective, Editorial, Commentary, etc.
#3: https://www.nytimes.com/2019/02/28/opinion/alexandria-ocasio-cortez-cohen-hearing.html

3.6.3 Signal: Text type is satire

Ref	Definition (Template)	Tags
0cc29b00	[Text] appears to be a satire piece
a151c9c3	[source] is self-described satire site
f6fe3e31	[subject article] URL contains directory name or file name indicating satire
4cc17948	[subject article] is self-labeled satire

Examples: #2: Satire, humor, etc.
#3: https://www.newyorker.com/humor/borowitz-report/mueller-says-he-has-obtained-trumps-sat-scores
#4: http://www.thedailyrash.com/about “The Daily Rash is satire! Merely a parody of the life that we watch around us daily. We spoof the famous and not so famous people who fill our lives with beauty and who bring us so much joy. Any similarities between our stories and real life are coincidental. Nothing here is very true.”

3.6.4 (Section with no title?)

Ref	Definition (Template)	Tags
71a19757	[Photo] is mostly likely original.

Ref	Definition (Template)	Tags
c5f9cdf0	[Photo] appears to be a copy of one or more image, with some portions modified or photoshopped

Ref	Definition (Template)	Tags
290357c3	[Photo][has been previously published by another source and its origins are not attributed.

Ref	Definition (Template)	Tags
8b9e10a9	[Photo] has been extensively altered from its original form in a way that changes the meaning.

5. Subject type: Audio

Also called: Audio Clip, Sound Clip, Audio Recording

5.1 Audio type

Editor note: This should probably be abstracted to all different types of contents.

5.1.1 Signal: Audio type is news

Ref	Definition (Template)	Tags
18da5ac5	[Audio] appears to be news.

5.1.2 Signal: Audio type is opinion

Ref	Definition (Template)	Tags
e1f7bea2	[Audio] appears to be an opinion piece

5.1.3 Signal: Audio type is advertising or marketing.

Ref	Definition (Template)	Tags
25efaf85	[Audio] appears to be advertising or marketing.

5.1.4 Signal: Audio roles - host

Ref	Definition (Template)	Tags
bdde72c3	[Audio] has an in-studio host.

5.1.5 Signal: Audio roles - reporter

Ref	Definition (Template)	Tags
941fef55	[Audio] has a reporter.

5.1.6 Signal: Audio roles - members of the public

Ref	Definition (Template)	Tags
2afde61a	[Audio] has interviews with members of the public.

5.1.7 Signal: Audio roles - experts and/or officials

Ref	Definition (Template)	Tags
4696bbb1	[Audio] has interviews with expert and/or official sources

5.1.8 Signal: Studio conversation

Ref	Definition (Template)	Tags
6b6d067f	[Audio] has conversation between host and interviewee who is not a reporter.

5.1.9 Signal: Call-ins

Ref	Definition (Template)	Tags
28c8729a	[Audio] has call-ins from members of the public.

5.1.10 Signal: Studio

Ref	Definition (Template)	Tags
8fe1900f	[Audio] sounds like it was at least partially recorded in a studio.

5.1.11 Signal: Outside

Ref	Definition (Template)	Tags
ea839d7b	[Audio] sounds like it was at least partially recorded outdoors.

5.2 Signal: Station/company identification

Ref	Definition (Template)	Tags
2e3c943a	Station or company that produced the [audio] is identified.

5.3 Signal: Host/reporter identification

Ref	Definition (Template)	Tags
2d3a576f	Host of [audio] identifies themselves.
52e6532c	Reporter of [audio] identifies themselves

5.4 Signal: Quoted individuals are identified.

Ref	Definition (Template)	Tags
4b5d600a	Individuals quoted in [audio] are identified by affiliation, if being quoted in a professional capacity.
dc425827	Individuals quoted in [audio] are identified by name.

5.5 Signal: Attribution

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.

5.6 Rhetoric

5.6.1 Signal: Proportional rhetoric

Editor: These should go to some category that includes both text and audio and video. Linguistic content.

Ref	Definition (Template)	Tags
fe3b53d1	The rhetoric used in [audio] is proportional to the event or situation described.

5.6.2 Signal: Extreme Exaggerating Rhetoric

Ref	Definition (Template)	Tags
853a3706	The rhetoric used in [audio] is an extreme exaggeration of the event or situation described.

5.6.3 Signal: Extreme Minimizing Rhetoric

5.6.3.1 (Section with no title?)

Key	Proposed Template Statement	By
1	The rhetoric used in [audio] is an extreme minimization of the event or situation described.	Tamar Wilner, adapting Credibility Coalition

5.7 Emotional valence

5.7.1 Signal: Extremely negative valence

Ref	Definition (Template)	Tags
c11d9c4c	The language of the reporter or main speaker in the [audio] is extremely negative.

5.7.2 Signal: Extremely positive valence

Ref	Definition (Template)	Tags
af0af713	The language of the reporter or main speaker in the [audio] is extremely positive.

5.7.3 (Section with no title?)

5.7.4 Signal: Neutral valence

Ref	Definition (Template)	Tags
88d6b211	The language of the reporter or main speaker in the [audio] is neutral.

5.8 Sound quality

5.8.1 Signal: Clear speech

Ref	Definition (Template)	Tags
854f4591	There is audio distortion making speech in the [audio] difficult to understand.

5.9 Music

5.9.1 Signal: Emotional music

Ref	Definition (Template)	Tags
72ecf4dd	The [audio] contains music that appears designed to manipulate listener emotions.

5.9.2 Signal: Loud music

Ref	Definition (Template)	Tags
1e4e78a0	The [audio] contains music loud enough to make speech difficult to hear.

6. Subject type: Video

Also called: Video Clip, Video Recording, Movie

6.1 Signal: Video type is news

Ref	Definition (Template)	Tags
18da5ac5	[Video] appears to be news.

6.1.1 Signal: Video type is opinion

Ref	Definition (Template)	Tags
e1f7bea2	[Video] appears to be an opinion piece

6.1.2 Signal: Video type is advertising or marketing.

Ref	Definition (Template)	Tags
25efaf85	[Video] appears to be advertising or marketing.

6.2 Signal: Station/company identification

Ref	Definition (Template)	Tags
2e3c943a	Station or company that produced the [audio] is identified.
2e3c943a	Station or company that produced the [video] is identified.

Ref	Definition (Template)	Tags
2e3c943a	Station or company that produced the [audio] is identified.
2e3c943a	Station or company that produced the [video] is identified.

6.3 Signal: Host/reporter identification

Ref	Definition (Template)	Tags
2d3a576f	Host of [audio] identifies themselves.
2d3a576f	Host of [video] identifies themselves.
52e6532c	Reporter of [audio] identifies themselves
52e6532c	Reporter of [video] identifies themselves

Ref	Definition (Template)	Tags
2d3a576f	Host of [audio] identifies themselves.
2d3a576f	Host of [video] identifies themselves.
52e6532c	Reporter of [audio] identifies themselves
52e6532c	Reporter of [video] identifies themselves

6.4 Signal: Quoted individuals are identified.

Ref	Definition (Template)	Tags
4b5d600a	Individuals quoted in [audio] are identified by affiliation, if being quoted in a professional capacity.
dc425827	Individuals quoted in [audio] are identified by name.
4b5d600a	Individuals quoted in [video] are identified by affiliation, if being quoted in a professional capacity.
dc425827	Individuals quoted in [video] are identified by name.

Ref	Definition (Template)	Tags
4b5d600a	Individuals quoted in [audio] are identified by affiliation, if being quoted in a professional capacity.
dc425827	Individuals quoted in [audio] are identified by name.
4b5d600a	Individuals quoted in [video] are identified by affiliation, if being quoted in a professional capacity.
dc425827	Individuals quoted in [video] are identified by name.

6.5 Signal: Attribution

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
31755cbb	[Video] does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
31755cbb	[Video] does not include attribution.

6.6 Rhetoric

6.6.1 Signal: Proportional rhetoric

Editor: These should go to some category that includes both text and audio and video. Linguistic content.

Ref	Definition (Template)	Tags
fe3b53d1	The rhetoric used in [audio] is proportional to the event or situation described.
fe3b53d1	The rhetoric used in [video] is proportional to the event or situation described.

Ref	Definition (Template)	Tags
fe3b53d1	The rhetoric used in [audio] is proportional to the event or situation described.
fe3b53d1	The rhetoric used in [video] is proportional to the event or situation described.

6.6.2 Signal: Extreme Exaggerating Rhetoric

Ref	Definition (Template)	Tags
853a3706	The rhetoric used in [audio] is an extreme exaggeration of the event or situation described.

6.6.2.1 (Section with no title?)

Key	Proposed Template Statement	By
1	The rhetoric used in [video] is an extreme exaggeration of the event or situation described.	Tamar Wilner, adapting Credibility Coalition

6.6.3 Signal: Extreme Minimizing Rhetoric

6.6.3.1 (Section with no title?)

Key	Proposed Template Statement	By
1	The rhetoric used in [video] is an extreme minimization of the event or situation described.	Tamar Wilner, adapting Credibility Coalition

6.7 Emotional valence

6.7.1 Signal: Extremely negative valence

Ref	Definition (Template)	Tags
c11d9c4c	The language of the reporter or main speaker in the [audio] is extremely negative.
c11d9c4c	The language of the reporter or main speaker in the [video] is extremely negative.

Ref	Definition (Template)	Tags
c11d9c4c	The language of the reporter or main speaker in the [audio] is extremely negative.
c11d9c4c	The language of the reporter or main speaker in the [video] is extremely negative.

6.7.2 Signal: Extremely positive valence

Ref	Definition (Template)	Tags
af0af713	The language of the reporter or main speaker in the [audio] is extremely positive.
af0af713	The language of the reporter or main speaker in the [video] is extremely positive.

Ref	Definition (Template)	Tags
af0af713	The language of the reporter or main speaker in the [audio] is extremely positive.
af0af713	The language of the reporter or main speaker in the [video] is extremely positive.

6.7.3 (Section with no title?)

6.7.4 Signal: Neutral valence

Ref	Definition (Template)	Tags
88d6b211	The language of the reporter or main speaker in the [audio] is neutral.
88d6b211	The language of the reporter or main speaker in the [video] is neutral.

Ref	Definition (Template)	Tags
88d6b211	The language of the reporter or main speaker in the [audio] is neutral.
88d6b211	The language of the reporter or main speaker in the [video] is neutral.

6.8 On-screen text

Relates to all on-screen text including chyrons, attributions…?

Signals for identification of quoted individuals; rhetoric; valence; what else?

CHYRON DISAGREEMENT WITH AUDIO

6.9 Images

6.9.1 Moving images

Video or film.

Signals for valence, what else?

6.9.2 Data graphics

6.9.2.1 Signal: Attribution

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.

6.9.2.2 (Section with no title?)

6.9.2.3 Signal: Graph Y-axis does not start at zero.

Issue: Should graphic display of data get their own subject-category? Charts? Cf Tufte

Ref	Definition (Template)	Tags
5c73f2de	[Video] data graphic y-axis starts at a number other than zero..

6.9.3 Still photography

6.9.3.1 Signal: Attribution

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.
364044b4	[Video] still photography does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.
364044b4	[Video] still photography does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.
364044b4	[Video] still photography does not include attribution.

Ref	Definition (Template)	Tags
bf3c76e5	[Audio] does not include attribution for the claims made.
384bbffc	[Video] data graphic does not include attribution.
31755cbb	[Video] does not include attribution.
364044b4	[Video] still photography does not include attribution.

6.9.4 Other graphics

Signals for rhetoric, valence, what else?

6.10 Music

Signals for valence and drama/exaggeration?

7. Subject type: Article

Includes: News Story, News Article, Scientific Paper, Blog Post

An article is a collection of information intended to convey some information, usually factual, usually created by one or more identifier people, and usually released at a specific point in time in some venue. It consists of elements like a body, a title, a publication date, and an author list. Unlike Texts, where any change makes it a different Text, an Article may be revised over time and still be considered the same Article (albeit a different version). Usually only minor changes are socially appropriate, however. Consumers of credibility data may need to be cautious of which version an observation applies to.

If an article appears on a web page, or in a portion of a web page, we can use its URL to identify the article.

Differentiation between Article and Text. Consider whether the signal data would be the same if the text were moved to a different article, perhaps published in a different venue, with a different title, at a different time, and with other text before or after it in some article. If the observation would be the same, then the signal is a property of the text, not the article. In that case it be in 3. Subject type: Text not here.

7.1 Originality

7.1.1 Originality Types

7.1.1.1 Signal: Most Likely Original

Ref	Definition (Template)	Tags
e595c4b2	Text of [subject article] is mostly likely original.
71a19757	[Photo] is mostly likely original.

Ref	Definition (Template)	Tags
e595c4b2	Text of [subject article] is mostly likely original.
71a19757	[Photo] is mostly likely original.

7.1.1.2 Signal: Appears to be a Copy, with Some Different Portions

Ref	Definition (Template)	Tags
6acabe52	Text of [subject article] appears to be a copy of one or more articles, with some portions different or remixed

7.1.1.3 Signal: Quotes Extensively From Another Source

Ref	Definition (Template)	Tags
8001f76c	Text of [subject article] quotes extensively from another source, with some original content

7.1.1.4 Signal: Wholesale Duplicate

Ref	Definition (Template)	Tags
69857b93	Text of [subject article] is a wholesale duplicate of another article

7.1.2 Attribution of Non-Original Content

These signals assume that the content has already been flagged as not original.

7.1.2.1 Signal: Attribution Given and Accurate[t]

Ref	Definition (Template)	Tags
9ce28cf4	[subject article] includes accurate attribution, pointing to the original.

7.1.2.2 Signal: Attribution Given and Inaccurate

Ref	Definition (Template)	Tags
334e1ec7	[subject article] includes inaccurate attribution.

7.1.2.3 Signal: Attribution Not Given

Ref	Definition (Template)	Tags
31755cbb	[subject article] does not include attribution.

7.1.2.4 Signal: Unclear Which is Original

Ref	Definition (Template)	Tags
b1794319	[subject article] is a copy, but it is unclear which is the original.

7.1.3 (Section with no title?)

7.1.4 Personal Perspective

These signals help parse author perspective on the content of the article.

7.1.4.1 Signal: Article contains personal perspective on lived experience

Ref	Definition (Template)	Tags
9f948295	[subject article] includes “I” statements AND recounts personal lived experience
f275cebe	[subject article] includes “I” statements and does NOT recount personal lived experience

7.1.4.2 “I” statements can signal author conjecture or personal experience. The former may be more likely to contain misinformation while the latter is necessary to recount first-hand research.

7.2 Language and Rhetoric

To-do: Move Rhetoric to a different bucket, not Article.

7.2.1 Rhetorical Proportionality

7.2.1.1 Signal: Proportional Rhetoric

Ref	Definition (Template)	Tags
fe3b53d1	The rhetoric used in [Text] is proportional to the event or situation described.

7.2.1.2 Signal: Extreme Exaggerating Rhetoric

Ref	Definition (Template)	Tags
853a3706	The rhetoric used in [Text] is an extreme exaggeration of the event or situation described.
853a3706	The rhetoric used in [audio] is an extreme exaggeration of the event or situation described.

Ref	Definition (Template)	Tags
853a3706	The rhetoric used in [Text] is an extreme exaggeration of the event or situation described.
853a3706	The rhetoric used in [audio] is an extreme exaggeration of the event or situation described.

7.2.1.3 Signal: Extreme Minimizing Rhetoric

Ref	Definition (Template)	Tags
853a3706	The rhetoric used in [Text] is an extreme exaggeration of the event or situation described.

7.2.2 Signal: Emotional Valence

Could be measured by VADER (Valence Aware Dictionary and sEntiment Reasoner) Natural Language Processing library

Ref	Definition (Template)	Tags
d3f90e88	Is the language extremely negative, extremely positive, or somewhere in the middle? ***	idea(cciv)

7.2.2.1 Signal: Extremely Negative Valence

Ref	Definition (Template)	Tags
7a192974	The language in [Text] is extremely negative.

7.2.2.2 Signal: Extremely Positive Valence

Ref	Definition (Template)	Tags
c07b1b5d	The language in [Text] is extremely positive.

7.2.2.3 Signal: Neutral Valence

Ref	Definition (Template)	Tags
d8c8293c	The language in [Text] is neutral.

7.2.3 Signal: Polarizing Language

Ref	Definition (Template)	Tags
d8e3739c	[Text] uses language such as “pro” and “anti,” signaling a division into two sharply contrasting groups or sets of opinions or beliefs.

Developed for CredCo Political Indicators Study 2018-19. Taken from the Oxford Living Dictionary’s definition of polarization as the “division into two sharply contrasting groups or sets of opinions or beliefs.” Can be used in combination with 7.8. Claims in Articles.

7.2.4 Signal: Generalization/Characterization of Group

Ref	Definition (Template)	Tags
1a6fe948	[Text] in [Content-Object] [u][v][w]characterizes a group or groups of people along lines that explicitly differentiate them from others.

Developed for CredCo Political Indicators Study 2018-19.This can apply to situations in which the author is associated with the defined group or defining an external group. Can be used in combination with 7.8. Claims in Articles and other “Content-Objects.”

7.2.5 Signal: Dehumanization

Ref	Definition (Template)	Tags
7fcf4da7	[Text] equates a human individual or group(s) as insects, bacteria, despised animals, cancer — less than human beings.

Developed for CredCo Political Indicators Study 2018-19. See https://dangerousspeech.org/about-dangerous-speech/.

7.2.6 Signal: Exhortation

This signal is meant to capture exhortations, or “an address or communication emphatically urging someone to do something.”

Ref	Definition (Template)	Tags
9a77b9cf	[Text] is an address that exhorts, or urges someone to do something.

7.2.7 Signal: Call to Violence

This signal is meant to capture a call to violence. Perhaps also expressed as part of ‘Dangerous Speech’: “ any form of expression (speech, text, or images) that can increase the risk that its audience will condone or participate in violence against members of another group” (see https://dangerousspeech.org/about-dangerous-speech/).

Ref	Definition (Template)	Tags
b051c9b2	[Text] contains language that can be understood as a call to violence or seems harmful.

Developed for CredCo Political Indicators Study 2018-19.

7.2.8 Signal: Call to Action (Political)

This signal is meant to capture a textual call to action, not to be confused with a marketing call to action https://en.wikipedia.org/wiki/Call_to_action_(marketing). Sometimes, these calls to action are also associated with requests for enacting/executing an action as an expression of one’s loyalty, identity, or affiliation.

Ref	Definition (Template)	Tags
70dd369e	[Text] contains language that can be understood as a political call to action, which requests readers to follow-through with a particular task, or tells readers what to do such as: signing online petitions, joining a mailing list, giving donations, voting, protesting, boycotting.

Developed for CredCo Political Indicators Study 2018-19.

7.3 Logic/Reasoning

7.3.1 Types of Bias

7.3.1.1 Signal: Confirmation Bias

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)
5b19acfa	Text of [article title] contains examples of …..

7.4 Outbound References

7.4.1 Source Types

7.4.1.1 Signal: No Source Type Cited

Ref	Definition (Template)	Tags
bf2e6507	There is no source cited in [subject article].

7.4.1.2 Signal: Domain Expert Cited

Ref	Definition (Template)	Tags
af7ab75e	There is an expert cited in [subject article].

7.4.1.3 Signal: Study Cited

Ref	Definition (Template)	Tags
bfe3b411	There is a study cited in [subject article].

7.4.1.4 Signal: Unaffiliated Expert Cited about Study

Ref	Definition (Template)	Tags
a6ee73ba	There is an unaffiliated expert cited about [study] in [subject article].

This I believe is the best practice in reporting on scientific studies.

7.4.1.5 Signal: Organization Cited

Ref	Definition (Template)	Tags
56ecd8ab	There is an organization cited in [subject article].

7.4.1.6 Signal: Other Type of Source Cited

Ref	Definition (Template)	Tags
ccb6072c	There is another type of sourced cited in [subject article].

7.4.1.7 Signal: Anonymous Sources Cited

Ref	Definition (Template)	Tags
5ab55dfc	One or more anonymous sources are cited in [subject article].

7.4.1.8 Signal: Single Anonymous Sources Materially Cited

Ref	Definition (Template)	Tags
db18b042	A single anonymous source is materially cited in [subject article].

Would the interpretation of the article be substantively different without the single anonymous source.

7.4.1.9 Signal: Anonymous Sources Materially Cited

Ref	Definition (Template)	Tags
27096b57	One or more anonymous sources are materially cited in [subject article].

7.4.1.10 Signal: Multiple Anonymous Sources Materially Cited

Ref	Definition (Template)	Tags
70fff801	More than one anonymous sources are materially cited in [subject article].

7.4.1.11 Signal: Multiple Anonymous Sources are Cited in Corroboration for Information

Ref	Definition (Template)	Tags
d0c94f6a	More than one anonymous sources are corroboratively cited in providing information for [subject article].

Is more than one source cited for information?

7.4.1.12 Signal: Motivation of Anonymous Source Wanted Anonymity is Given

Ref	Definition (Template)	Tags
69a1b877	The motivation of the anonymous sources to be anonymous is given in [subject article].

7.4.1.13 Signal: Documents are Cited in the Article

Ref	Definition (Template)	Tags
457405d2	Documents are cited in [subject article].

7.4.1.14 Signal: Documents Cited in the Article Are Made Available in Publication

Ref	Definition (Template)	Tags
5da755f8	Documents cited in [subject article] are also made available in publication.

7.4.2 Signal: Contains Link to Scientific Journals

Ref

Definition (Template)

Tags

fd96f7bf

The text includes a link to original content

idea(cciv) st(cciv)

Notes (not normative):

cciv: A simple link to the a scientific journal article that backs up the assertion made. This may also be paired with a URL to the specific article.

2a85d78f

There is a link provided in [subject article] to where the original content came from.

7.4.3 Signal: Accuracy of representation of source article

Also called: Representative Citations

Ref	Definition (Template)	Tags
268b0193	The text properly characterizes the methods and conclusions of its sources	idea(cciv) st(cciv)
Notes (not normative): cciv: This article properly characterizes the methods and conclusions of the cited or quoted source. In addition to a Likert measure, two other options are possible: (A) Unable to find source, (B) Source is behind a paywall
890a77ac	This article properly characterizes the methods and conclusions of the cited or quoted source (Source 1).
4bce9ba4	This article properly characterizes the methods and conclusions of the cited or quoted source (Source 2).
e60e0259	This article properly characterizes the methods and conclusions of the cited or quoted source (Source 3).
f2e110d3	[subject article] properly characterizes the methods and conclusions of the original source.

7.4.4 Signal: Academic Journal Impact Factor

Ref	Definition (Template)	Tags
18db0556	The impact factor of the journal or conference cited is [number].
059e6c34	What is the impact factor of the journal or conference cited? *** From Wikipedia: The impact factor (IF) or journal impact factor (JIF) of an academic journal is a measure reflecting the yearly average number of citations to recent articles published in that journal. It is frequently used as a proxy for the relative importance of a journal within its field; journals with higher impact factors are often deemed to be more important than those with lower ones. https://en.wikipedia.org/wiki/Impact_factor	idea(cciv)

7.4.4.1 Signal: Academic Journal Impact Factor Cannot Be Found

Ref	Definition (Template)	Tags
8c312a70	The impact factor of the journal or conference cited cannot be found.

7.5 Article/Site Metadata

7.5.1 Signal: Subhed/Dek

Ref	Definition (Template)	Tags
a675ee6e	A “dek” is a subhed in journalism that appears below the headline of an article, usually in a smaller font (but in a larger font than the main body of the article). It typically summarizes the article or highlights a main point from the article.

7.6 Claims in Articles

Although there is a separate section for Claims [2. Subject type: Claim], this section deals with the case when the analysis of one or more claims within an article is made to signify something about the article itself. [Probably could use an introductory paragraph on different levels/objects once those are clarified, since this translation is taking place for a number of projects, consider articles to domains/publishers.]

In the following signals, an assumption is made on the existence of a central claim of the article that is recognizable.

7.6.1 Signal: Article has a central claim

Ref	Definition (Template)	Tags
c871d2dc	The central claim in [Article] is [Claim].[x]
001c97cb	There is a central claim in [Article].

The first version of this signal was used in Credibility Coalition’s WebConf2018 study in which it was expressed as a question with multiple choice answers as follows:

Has the central claim in this article been fact-checked by an IFCN Verified Signatory?

A) Most likely not fact-checked by an IFCN Verified Signatory

B) Most likely not fact-checked by an approved source

C) Fact-checked and determined false

D) Fact-checked and determined true

E) Fact-checked with unclear results

F) Fact-checked with mixed results

It was initially deprecated due to the recognition of a number of valuable fact-checking efforts that are not IFCN Signatories, but then has remained with a change to its options as follows:

Does the article rely on a claim that has been fact-checked by a member of the International Fact Checking Network (IFCN)? If so, has it been debunked?

A) The article was fact-checked and determined false

B) The article was fact-checked and determined true

C) The article was fact-checked with unclear results

D) The article was fact-checked with mixed results

E) The article was most likely not fact-checked by an IFCN member

To express these questions as signals, combine with the signals related to fact-checking organization, see section 2.2. Fact-checking Organization and 2.1. Claim Review above. ^[y]^[z]^[aa]^[ab]

7.6.2 Signal: Article has a claim

Ref	Definition (Template)	Tags
b7f2d683	[Claim] is a claim in [Article].

18. To Be Categorized

Comparative indicators (as per BitPress): “[article1] and [article2] describe the same event in a significantly different way”. Something like that. For example:

[article1] includes an important claim [claim] that [article2] omits when describing the same topic

But that could be done as “[article1] describing event [event] makes claims [claim]”

18.1 Signal: HTML Usage

Ref	Definition (Template)	Tags
a65e3e73	[website] front page uses syntactically valid HTML	sometag(CSVDemo)

18.2 Signal: Attribution of Non-Original Content

Ref	Definition (Template)	Tags
038b0a36	If the content of the article is not original, was attribution given and if so, was the attribution accurate? *** (A) Attribution was not given (B) Attribution was given but was inaccurate (C) Attribution was given and was accurate (D) Unclear which is the original	idea(cciv)

18.3 Signal: Is Original

Ref	Definition (Template)	Tags
9721ce9a	Has the text of this article appeared in exactly the same words or very similar words in another publication? *** (A) Most likely original (B) Appears to be a copy of one or more articles, with some portions different or remixed (C) Extensive quoting from another source, with some original content (D) A wholesale duplicate of another article	idea(cciv)

18.4 Signal: Publication (site)

Ref	Definition (Template)	Tags
07de125b	*** Parent of the Article.	idea(cciv)

18.5 Signal: Dateline (Date)

Ref	Definition (Template)	Tags
656673f8	When does the article claim it was published? *** From An: "i think technically this states both the date and location of the article’s publication. in which case it needs two forms of operationalization	idea(cciv)

18.6 Signal: Author

Ref

Definition (Template)

Tags

d397cb76

The article has a byline identifying an author or authors.

idea-GL(cciv)

Notes (not normative):

cciv: Articles 1:M authorship. Note that not all articles have bylines, even in traditional news sources. Bylines also don't always start with 'by' (https://medium.com/@rchang/advice-for-new-and-junior-data-scientists-2ab02396cf5b)

18.7 Signal: Length

Ref

Definition (Template)

Tags

5b04d907

[The article] contains [#] words.

idea-GL(cciv)

Notes (not normative):

cciv: Need to count the words in the article.

18.8 Signal: Language

Ref

Definition (Template)

Tags

09798548

What language(s) does the publication publish in? ***

idea(cciv)

14087209

[The article] language is [language].

idea-GL(cciv)

Notes (not normative):

cciv: Needs a tool that identifies the language.

18.9 Signal: Correction/Redaction

Ref	Definition (Template)	Tags
1d49cd64	The article contains a stated correction or redaction.	idea-GL(cciv)

18.10 Signal: Article Awards

Ref

Definition (Template)

Tags

c56851d7

The article identifies awards it or the website has received.

idea-GL(cciv)

Notes (not normative):

cciv: *** Awards are also assigned to specific Articles (but rarely)

18.11 Signal: Article Locator

Ref	Definition (Template)	Tags
38d79b22	*** URL, DOI,	idea(cciv)

18.12 Signal: Genre

Ref	Definition (Template)	Tags
ce682f01	What is the state genre, if available? *** Opinion, Feature, Biography -- but it will not always be labeled and i am not sure this will be defined consistently	idea(cciv)

18.13 Signal: Dateline (Location)

Ref	Definition (Template)	Tags
7459cd99	Where does the article claim it was published? ***	idea(cciv)

18.14 Signal: Is Translation

Ref	Definition (Template)	Tags
5874c92d	Is the article a translation? ***	idea(cciv)

18.15 Signal: Source Language (if translation)

Ref	Definition (Template)	Tags
28eb8320	If the article is a translation, what is the source language? ***	idea(cciv)

18.16 Signal: Factual assertions

Ref	Definition (Template)	Tags
1562b9d1	Does the article contain factual, verifiable assertions or is entirely opinion based? *** Complements the Genre (is the content actually opinion piece? or verifiable information?)	idea(cciv)

18.17 Signal: Subject Area

Ref	Definition (Template)	Tags
0ffe6939	What is the article's genre? *** Indicates a subject of an item: Sports, Entertainment	idea(cciv)

18.18 Signal: Shows versions and changes

Ref	Definition (Template)	Tags
269464a3	Does this article show revisions/diffs? (Most places do not) *** Shows versions, changes, diffs of an article.	idea(cciv)

18.19 Signal: Subhead/Dek

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.20 Signal: Publication Domain Registration Date

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.21 Signal: Publication Domain Registration Location

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.22 Signal: Article Rights

Ref	Definition (Template)	Tags
c400e7d6	What are the rights for this article? *** Explicit Copyright, Creative Commons, Unstated, etc.	idea(cciv)

18.23 Signal: Photo/Video Geotags

Ref	Definition (Template)	Tags
1a6a039b	What are the geolocations for the photos and videos on the article? ***	idea(cciv)

18.24 Signal: Occupation

Ref	Definition (Template)	Tags
89df0693	What is the occupation of the author? *** What is the occupation of the author. Are they a full-time journalist or something else	idea(cciv)

18.25 Signal: Education Credentials

Ref	Definition (Template)	Tags
ce8ba291	What is the educational background of the author? *** What is the educational background of the author. Do they have a degree in journalism? Do they have any post-grad education?	idea(cciv)

18.26 Signal: Track record

Ref	Definition (Template)	Tags
0bbd02f7	Has the author already published articles containing misleading or credible information? ***	idea(cciv)

18.27 Signal: Number of publications

Ref	Definition (Template)	Tags
5203fc59	How many publications does this author have and in which venues? *** How many publications does this author have and in which venues.	idea(cciv)

18.28 Signal: Public Accessibility

Ref	Definition (Template)	Tags
e6d13230	How accessible/responsive is this author to the public? *** How accessible is this author to the public. Do they have a website? Does this author have a publicly available email address? Are they on Twitter or Facebook? Do they often respond to readers?	idea(cciv)

18.29 Signal: Followers/Listeners

Ref	Definition (Template)	Tags
34b38463	What degree of attention does this author command from other individuals? *** How many people follow this author (on social media). How many other journalists follow this author? What speaking engagements does this author command?	idea(cciv)

18.30 Signal: Has Author Bio

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.31 Signal: Claim contains logical fallacy

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.32 Signal: Claim contains false assertion

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.33 Signal: Claim contains false and misleading assertion

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.34 Signal: Claim contains misleading assertion

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.35 Signal: Claim contains bad data

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.36 Signal: Links from other news sites

Ref	Definition (Template)	Tags
6c269cbf	What are the 5 most popular (Alexa rating) news sites linking to this article? *** What other news sites are linking to this URL? Does it include credible sites or not?	idea(cciv)

18.37 Signal: Facebook shares

Ref	Definition (Template)	Tags
aca0b2b3	How many facebook shares? *** Number of times shared on facebook. Important to estimate the reach of the news article.	idea(cciv)

18.38 Signal: Facebook engagement

Ref	Definition (Template)	Tags
73dbfb79	How many people engaged on facebook? *** Number of facebook engagement. Important to estimate the reach of articles.	idea(cciv)

18.39 Signal: Ratio of comments to likes <10%

Ref	Definition (Template)	Tags
21c9abae	Does it have many more likes than comments? *** Ratio of comments to likes on Facebook. Indicative of bot-promoted content.	idea(cciv)

Ref	Definition (Template)	Tags
915c563f	What are the 5 most liked (Twitter, Facebook), upvoted (Reddit) social accounts that shared this link *** What social media accounts have shared the URL? Do they include credible accounts?	idea(cciv)

18.41 Signal: Facebook comment quality

Ref

Definition (Template)

Tags

0113f86a

All or nearly all the Facebook comments are mono-sentence.

idea(cciv) j8l(cciv)

Notes (not normative):

cciv: *** Number and content of facebook comments. Important to estimate the reach and sentiment of articles.

18.42 Signal: Facebook comment quantity

Ref	Definition (Template)	Tags
1fa2c4b8	There are 10 or fewer Facebook comments	idea(cciv) j8l(cciv) new addition(cciv)

18.43 Signal: Facebook comment sentiment

Ref	Definition (Template)	Tags
4104d376	What is the sentiment of the facebook comments? *** Sentiment analysis on Facebook comments. Important to gauge the attitudes towards the news, and types of reactions they invoke.	idea(cciv)

18.44 Signal: Number of links in Wikipedia

Ref

Definition (Template)

Tags

c9af8b4d

Multiple Wikipedia articles point to the article

idea(cciv) st(cciv)

Notes (not normative):

cciv: Number of inbound links from Wikipedia main namespace.

18.45 Signal: Represents scientific literature

Ref	Definition (Template)	Tags
e7776710	The content fairly represents scientific literature on the issue at hand	idea(cciv) st(cciv)

18.46 Signal: Represents scientific process

Ref	Definition (Template)	Tags
f9fc7eb3	The content rigorously represents the scientific process	idea(cciv) st(cciv)

18.47 Signal: Presents multiple perspectives

Ref	Definition (Template)	Tags
7f4d0899	The content fairly represent multiple perspectives on an issue	idea(cciv) st(cciv)

18.48 Signal: Cites wire services

Ref	Definition (Template)	Tags
b4fafa09	The article clearly states a known wire service as a source	idea(cciv) st(cciv)

18.49 Signal: Dependency on anonymous sources

Ref

Definition (Template)

Tags

2c269b10

THe premise of this article is based on anonymous sources.

idea(cciv) ng(cciv)

Notes (not normative):

cciv: This can be made more nuanced depending on how many sources, and how critical it is to the main premise of the article.

18.50 Signal: Gives motivation of anonymous sources for revealing information

Ref	Definition (Template)	Tags
1bc9cc79	This article explains the motivation for all of the anonymous sources revealing information.	idea(cciv) og(cciv)

18.51 Signal: Shares any documents cited in piece

Ref

Definition (Template)

Tags

3ea072fe

This article publishes (or links to?) any important documents cited within it.

idea(cciv) ng(cciv)

Notes (not normative):

cciv: Does this refer to inlinks within the article? or if the creators of the article only cites articles they themselves publish

18.52 Signal: Primary subjects of article have opportunity to respond

Ref

Definition (Template)

Tags

50dbc854

The main subjects of the article have a chance to respond (to the main points of the artice? in a direct or indirect quote after being interviewed?)

idea(cciv) ng(cciv)

Notes (not normative):

cciv: Not sure if the responses metric refers to responses within the article, or responses via commenting

18.53 Signal: Premise of article is disputed by primary subjects

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.54 Signal: Corroboration?

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.55 Signal: Straw Man Argument

Ref	Definition (Template)	Tags
7e90dbc5	The [text] presents a counterargument as a weaker, more foolish version of the real counterargument (uses a Straw Man Argument).	idea(cciv) sh(cciv)

18.56 Signal: False Dilemma

Ref	Definition (Template)	Tags
e0958f51	The [text] presents a complicated choice as if it were binary (constructs a false dilemma).	idea(cciv) og(cciv)

18.57 Signal: Slippery Slope Argument

Ref

Definition (Template)

Tags

aec147ca

The text says that one small change will lead to a major change

idea(cciv) st(cciv)

Notes (not normative):

cciv: Slippery slope argument.

18.58 Signal: Naturalistic Fallacy

Ref	Definition (Template)	Tags
534e79cd	The [text] suggests that something is good because it is natural, or bad because it is not natural (the naturalistic fallacy).	idea(cciv) og(cciv)

18.59 Signal: Calibrating Confidence - Justification

Ref	Definition (Template)	Tags
98495085	The author's confidence in their claims is well justified	idea(cciv) og(cciv)

18.60 Signal: Appeal to Fear Fallacy

Ref	Definition (Template)	Tags
a5e90858	The [text] exaggerates the dangers of a situation and uses scare tactics to persuade (uses the appeal to fear fallacy).	DK(cciv) idea(cciv)

18.61 Signal: Calibrating Confidence - Level of Confidence

Ref	Definition (Template)	Tags
c2385f39	The [text] acknowledges uncertainty or the possibility that things might be otherwise (expresses level of confidence in a claim).	DK(cciv) idea(cciv)

18.62 Signal: Causal Claim Types

Ref	Definition (Template)	Tags
527f60d7	Is a general or singular causal claim made? Highlight the section(s) that supports your answer. *** General Causal Claim Singular Causal Claim No Causal Claim	idea(cciv)

18.63 Signal: Draws sound conclusions from available evidence

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.64 Signal: Begging the Question

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.65 Signal: Mistaking Noise for Signal

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.66 Signal: Orders of Understanding

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.67 Signal: Number of Enthymemes (Arguments with Missing Premises)

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.68 Signal: Numbers of Argument Components

Ref	Definition (Template)	Tags
6d3d2db5	The argumentation and logic in the article is complex	idea(cciv) st(cciv)

18.69 Signal: Number of Claims

Ref	Definition (Template)	Tags
ffc5d663	How far could this article go wrong? ***	idea(cciv)

18.70 Signal: Number of Arguments Against

Ref	Definition (Template)	Tags
5f5ead5b	the text is biased against one set of premises	idea(cciv) st(cciv)

18.71 Signal: Number of Supporting Premises

Ref	Definition (Template)	Tags
8c6efeeb	The article backs up arguments with clear hypotheses	idea(cciv) st(cciv)

18.72 Signal: Average Number of Premises Per Claim

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.73 Signal: Number of Arguments For

Ref	Definition (Template)	Tags
a1edbdbe	the text is biased towards one set of premises	idea(cciv) st(cciv)

18.74 Signal: Number of Attacking Premises

Ref	Definition (Template)	Tags
f2ee8607	The text strongly contradicts itself	idea(cciv) st(cciv)

18.75 Signal: Use of conspiratorial thinking

Ref	Definition (Template)	Tags
a857394f	The article suggests a conspiracy	idea(cciv) st(cciv)

18.76 Signal: Identifiable Victim Effect

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.77 Signal: Just World Fallacy

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.78 Signal: Supporting Claim Types

Ref	Definition (Template)	Tags
0728c3c4	What evidence is given for the primary claim? Select all that apply. *** Types of evidence for a claim: Correlation Cause precedes effect The correlation appears across multiple independent contexts A plausible mechanism is proposed An experimental study was conducted (natural experiments OK) Experts are cited Other kind of evidence No evidence given	idea(cciv)

18.79 Signal: Source Types

Ref	Definition (Template)	Tags
5fa3c065	Which of the following types of sources are cited in the article? Check all that apply. If Other, please highlight. *** None Experts Studies Organizations Other	idea(cciv)

18.80 Signal: Quotes reputable scientists

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.81 Signal: Databases

Ref	Definition (Template)	Tags
1e255af1	*** Databases as a 'Publication' owned by a Publisher	idea(cciv)

18.82 Signal: Agencies for Authority

Ref	Definition (Template)	Tags
db742e34	*** I think that Agencies could also be considered Publishers. But Publisher captures I think the relationship to the Article	idea(cciv)

18.83 Signal: Quotes outside experts

Ref	Definition (Template)	Tags
c6a45cc1	Does the article quote experts who are not part of the study but are part of the field? ***	idea(cciv)

18.84 Signal: Contains original quotes

Ref	Definition (Template)	Tags
aec59fc7	The article contains original quotes that appear to be sourced directly by the reporter.	DK(cciv) idea(cciv)

18.85 Signal: Contains Image Macros

Ref	Definition (Template)	Tags
3648b6fc	The article contains content with image macros.	DK(cciv) idea(cciv)

18.86 Signal: Number of links

Ref	Definition (Template)	Tags
4f7edf95	How many URLs does the article link out to? ***	idea(cciv)

18.87 Signal: Number of quoted sources

Ref	Definition (Template)	Tags
7113e6b1	How many sources does the article quote? ***	idea(cciv)

18.88 Signal: Contain Original Images

Ref	Definition (Template)	Tags
8f88a1f5	The [content] contains original images.	DK(cciv) idea(cciv)

18.89 Signal: Contains Attributed images

Ref	Definition (Template)	Tags
ed994261	The [content] contains images attributed to a photographer or other source.	DK(cciv) idea(cciv)

18.90 Signal: Contains Video Embeds

Ref	Definition (Template)	Tags
686014f0	Does the article embed content from video sites? *** List of embeddable video sites (YouTube, Vimeo, etc.)	idea(cciv)

18.91 Signal: Trust Project Metadata

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.92 Signal: Clear Editorial Policy

Ref	Definition (Template)	Tags
c30585d3	The website includes a clear editorial policy	idea(cciv) st(cciv)

Ref	Definition (Template)	Tags
b385555d	The publication includes a masthead or nameplate	idea(cciv) st(cciv)

18.94 Signal: Awards

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.95 Signal: Publication End Date

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.96 Signal: Publication Start Date

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.97 Signal: Publication Type

Ref	Definition (Template)	Tags
de8b889e	*** One of the following: Media Outlet, Governmental Agency, Non-profit, Private Corporation	idea(cciv)

18.98 Signal: Publication Identifier

Ref	Definition (Template)	Tags
3ef5b5e0	*** Publishers may have 1:M Publications	idea(cciv)

18.99 Signal: Publisher (Publication Owner)

Ref	Definition (Template)	Tags
920ca1c1	What is the name of the person or organization that published the publication under review? *** Publishers: Individuals or group entities -- see Publisher Table for more related attributes	idea(cciv)

18.100 Signal: Publication Name

Ref	Definition (Template)	Tags
4fc9bd5f	What is the name of the magazine, newspaper, journal, book that the article appeared within? *** Publications are parents of articles. Publishers may have 1:M Publications. Publications newsline magazines, series, newspapers, etc.	idea(cciv)

18.101 Signal: Publication CMS

Ref	Definition (Template)	Tags
a0a8fb0a	What is the CMS that the publication uses? ***	idea(cciv)

18.102 Signal: Niche Topic

Ref	Definition (Template)	Tags
c24fc749	Is the publication focused on a niche topic? If so, what is the topic? ***	idea(cciv)

18.103 Signal: Publication Domain

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

Ref	Definition (Template)	Tags
f6387d51	Does the publication have a clear and logical masthead/imprint? ***	idea(cciv)

18.105 Signal: Links to Relevant Articles

Ref	Definition (Template)	Tags
402ca152	Does the publication regularly link to other relevant articles on its own site? ***	idea(cciv)

18.106 Signal: Site Analytics

Ref	Definition (Template)	Tags
a32bb352	Does the publication use an analytics platform? Which one? ***	idea(cciv)

18.107 Signal: Has a Wikipedia Entry

Ref	Definition (Template)	Tags
86edc186	Does the publication have an entry in Wikipedia? ***	idea(cciv)

18.108 Signal: Working Phone Number

Ref	Definition (Template)	Tags
d91ed958	Is there a phone number and does calling the phone number lead someone to a representative of the publication? ***	idea(cciv)

18.109 Signal: Is Part of Press Corps

Ref	Definition (Template)	Tags
43056b49	Is the publication part of the press corps? ***	idea(cciv)

18.110 Signal: Average time spent on page

Ref

Definition (Template)

Tags

d8b18fd6

Average user spends [number] seconds on [webpage].

idea(cciv)

Notes (not normative):

cciv: (needs to go under Website) Dwell Time: Time a user spends on a page? Objectively captured by web logs. Sum of times / number of users. Note, this cannot identify the time spent actually reading the article.

18.111 Signal: Common Referrers

Ref	Definition (Template)	Tags
d158ef87	What is driving traffic to this article? *** From where did readers come from to read this article. Social media/unknown/news org page/search result page.	idea(cciv)

18.112 Signal: Volume of readership over time

Ref	Definition (Template)	Tags
3ca839fe	What is the nature of the volume of readership over time? *** What is the nature of the volume of readership over time. Is it spiky? How many spikes are there? Is there a long tail?	idea(cciv)

18.113 Signal: Emotional comment shared in response

Ref	Definition (Template)	Tags
62f91575	Did the article provoke emotional -- positive or negative -- comments and response (in the discussion space around its content)? *** The direction and weight of reader interaction with this article.	idea(cciv)

Ref	Definition (Template)	Tags
08f317b3	How strongly do you agree or disagree that the page of the article has aggressive advertisements? *** The page of the article has aggressive advertisements. This is limited to a subjective assessment at this time.	idea(cciv)

18.115 Signal: Presence of Donors

Ref	Definition (Template)	Tags
2a7d1df2	The site donors are clearly stated and reputable	idea(cciv) st(cciv)

18.116 Signal: Presence of Paywall or Subscription

Ref

Definition (Template)

Tags

3453d15b

The site contains a paywall or subscription

idea(cciv) st(cciv)

Notes (not normative):

cciv: which type?

18.117 Signal: Presence of Sponsors

Ref	Definition (Template)	Tags
6b2f930b	The article sponsors are clearly stated and reputable	idea(cciv) st(cciv)

18.118 Signal: Presence of Freemium Content

Ref	Definition (Template)	Tags
c90707a8	The site contains selectively free content outside of its paywall	idea(cciv) st(cciv)

18.119 Signal: Top Call to Action for Donations

Ref	Definition (Template)	Tags
ea4cfc89	The site or article has a topline call to action for donations	idea(cciv) st(cciv)

Ref	Definition (Template)	Tags
f4b0f415	How strongly do you agree or disagree that the page of the article has aggressive social shares? *** The page of the article has aggressive social shares, which may include calls to share the article within the text. This is limited to a subjective assessment at this time.	idea(cciv)

18.121 Signal: Emotionally Charged Tone

Ref	Definition (Template)	Tags
15fff94d	Does the article have an emotionally charged tone? (i.e, outrage, snark, celebration, horror, etc.). If so, highlight the relevant section(s). *** Article has an emotionally charged tone	idea(cciv)

18.122 Signal: Clickbait Headline

Ref	Definition (Template)	Tags
1aa19828	Is the headline clickbaity? *** A measure of how much the title of the article conforms to a predetermined set of clickbait genres.	idea(cciv)

18.123 Signal: Title Representativeness Types

Ref	Definition (Template)	Tags
c11c62b6	How is the title unrepresentative of the content of the article? (Select all that apply). *** Types of title representativeness (A) Title is on a different topic than the body (B) Title emphasizes different information than the body (C) Title carries little information about the body (D) Title takes a different stance than the body (E) Title overstates claims or conclusions in the body (F) Title understates claims or conclusions in the body	idea(cciv)

18.124 Signal: Clickbait Genres

Ref	Definition (Template)	Tags
c43d0138	What clickbait techniques does this headline employ (select all that apply)? *** A typology of clickbait headlines: Listicle (“6 Tips on …”) Cliffhanger to a story (“You Won’t Believe What Happens Next”, “Man Divorces His Wife After Overhearing This Conversation”) Provoking emotions, such as shock or surprise (“...Shocking Result”, “...Leave You in Tears”) Hidden secret or trick (“Fitness Companies Hate Him...”, “Experts are Dying to Know Their Secret”) Challenges to the ego (“Only People with IQ Above 160 Can Solve This”) Defying convention (“Think Orange Juice is Good for you? Think Again!”, “Here are 5 Foods You Never Thought Would Kill You”) Inducing fear (“Is Your Boyfriend Cheating on You?”) Other	idea(cciv)

18.125 Signal: Exaggerated Claims

Ref	Definition (Template)	Tags
43d71ea6	1.21 mc Does the author exaggerate any claims? If so, highlight the relevant section(s). *** Claims are exaggerated, as indicated by the tone	idea(cciv)

18.126 Signal: Reading Level

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.127 Signal: Politicizing Tone

Ref	Definition (Template)	Tags
8297892a	Does the content of the article politicize an issue unrelated to politics? ***	idea(cciv)

18.128 Signal: Contains Profanity

Ref	Definition (Template)	Tags
8b9ac8e1	Does the article contain profanity? ***	idea(cciv)

18.129 Signal: Grammatical Rules

Ref	Definition (Template)	Tags
9f83797e	Does the article follow rules of US or UK English grammar? ***	idea(cciv)

18.130 Signal: Spelling Errors

Ref	Definition (Template)	Tags
3bd39eaf	How many spelling errors does the article have? ***	idea(cciv)

18.131 Signal: Number of Exclamation Points

Ref	Definition (Template)	Tags
74b754a2	How many exclamation marks appear in the article? ***	idea(cciv)

18.132 Signal: Contains Hyperbolic Language

Ref	Definition (Template)	Tags
d0513609	Does the article contain hyperbolic language? ***	idea(cciv)

18.133 Signal: Hyperpartisanship / Political bias

Ref	Definition (Template)	Tags
23018043	*** Extreme political bias, e.g. unconstructive political discourse	idea(cciv)

18.134 Signal: Astroturfing

Ref	Definition (Template)	Tags
17f1269a	*** the practice of the masking sponsors of a message	idea(cciv)

18.135 Signal: Overly Emotional Language

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.136 Signal: Exaggeration

Ref	Definition (Template)	Tags
596f4162	***	idea(cciv)

18.137 Signal: Hate speech

Ref	Definition (Template)	Tags
0a580d75	*** Usage of abusive or toxic language, e.g. racism, sexism, etc.	idea(cciv)

18.138 Signal: Apophasis

Ref	Definition (Template)	Tags
50bb076a	*** A rhetorical device wherein the writer brings up a subject by either denying it, or denying it should be brought up	idea(cciv)

18.139 Signal: Cognitive Distortion

Ref	Definition (Template)	Tags
88d4a6cf	*** (the representation of something in an excessive manner)	idea(cciv)

18.140 Signal: Proportion (Exaggeration - Minimization Spectrum)

Ref	Definition (Template)	Tags
e3876cec	Is the description an extreme exaggeration, an extreme minimization, or proportional to the event or situation described? *** The extent to which language in the text is proportional to the situation, or exaggerates or minimizes events. Exaggeration as defined in Webster 1913, "...the act of doing or representing in an excessive manner;..." and Minimization as defined in Oxford Living Dictionary accessed June 2018, "1.1 Represent or estimate at less than the true value or importance"	idea(cciv)

Ref	Definition (Template)	Tags
7b21549b	[subject] has a prominent top or side menu structure or buttons or links, taking user to other parts of site
b0b323b3	[subject] has obvious navigation elements at one or more edges of the content, providing a way to reach other content on the same website

Ref	Definition (Template)	Tags
1c858afa	Does the title of the article accurately reflect the content of the article? *** A measure of how representative the content of the title is with the content of the body copy.	idea(cciv)
63bdbf87	Title is representative of the content of the article.

Ref	Definition (Template)	Tags
74758d68	Does an ads.txt file exist on the domain? *** ads.txt (Authorized Digital Sellers) is an Interactive Advertising Bureau initiative. It specifies a text file that companies can host on their web servers, listing the other companies authorized to sell their products or services. This is designed to allow online buyers to check the validity of the sellers from whom they buy, for the purposes of internet fraud prevention. (from https://en.wikipedia.org/wiki/Ads.txt)	idea(cciv)
363d7780	The domain contains an ads.txt file.

Ref	Definition (Template)	Tags
45f891af	How strongly do you agree or disagree that the page of the article has spammy or clickbaity advertisements? *** The page of the article has spammy or clickbaity advertisements. This is limited to a subjective assessment at this time.	idea(cciv)
37896453	The page of the article has spammy or clickbaity advertisements. This is limited to a subjective assessment at this time.

Ref	Definition (Template)	Tags
507e52c1	How many ads appears on the article page? *** There are multiple types of ads to look for. (1) Display ads. These are boxes that are clearly advertisements, typically in the form of a graphic image or, in the case of Google Adwords, a box with text. (2) Content recommendation engines, specifically, Taboola, Outbrain, Tivo, RevContent. A box of content recommendations on a page counts as one. (3) Sponsored content. This is content recommended on the site with a clear label: “Sponsored.” (4) Call for social sharing. (5) Call to subscribe to a mailing list	idea(cciv)
4168b252	The number of ads that appear on [subject article] is [number]. This includes display ads, content recommendation engines, sponsored content and call for social sharing

Ref	Definition (Template)	Tags
94cd8f08	How strongly do you agree or disagree that the page of the article has aggressive advertisements, including calls to join a mailing list? *** The page of the article has aggressive advertisements. This is limited to a subjective assessment at this time.	idea(cciv)
efcfda72	The page of the article has aggressive advertisements. This is limited to a subjective assessment at this time.
8d058562	The text of the article links to products.

Credibility Signals

Unofficial Draft 26 November 2019

Abstract

Status of This Document

1. Introduction

1.1 Purpose

1.2 Credibility Data

1.3 Example

1.4 Factors in Selecting Signals

1.4.1 Measurement Challenges

1.4.2 Value in Credibility Assessment

1.4.3 Feedback Risks (“Gameability”)

1.4.4 Interoperability

1.5 Publishing Credibility Data

1.6 Consuming Credibility Data

1.7 Organization of this document

1.8 Template Statements

1.9 Instructions for editing this document

1.9.1 Expand discussion

1.9.2 Add new template statements

1.9.3 Add new signals

1.10 Contributors

1.11 Sources

2. Subject type: Claim

2.1 Claim Review

2.1.1 Signal: Fact-check status of claim

2.1.2 Signal: Fact-check status of claim — VERIFIED

2.1.3 Signal: Fact-check status of claim — REFUTED

2.1.4 Signal: Fact-check status of claim — UNCLEAR

2.1.5 Signal: Fact-check status of claim — MIXED

2.1.6 Signal: Claim - Risk of Harm

2.1.7 Signal: Claim - Coded Meaning

2.2 Fact-checking Organization[k][l]

2.2.1 Signal: Fact-checking Organization commitments — member of the IFCN

2.2.2 Signal: Fact-checking Organization commitments — accuracy and professionalism

2.2.3 Signal: Fact-checking Organization commitments — unknown

2.3 Explicitly Unverified Claims

3. Subject type: Text

3.1 Formality

3.1.1 Signal: Formal tone

3.1.1.1 Signal: Correct Spelling

3.1.1.2 Signal: Correct Grammar

3.1.2 Signal: Informal tone

3.1.2.1 Signal: Slang

3.1.2.2 Signal: Informal grammar

3.2 References or citations

3.2.1 Signal: Uses standardized references or citations

3.2.2 Signal: Uses formal but not standardized references or citations

3.2.3 Signal: Few to zero references or citations

3.3 Pronouns

3.3.1 Signal: Many or multiple instances of the pronouns "I" or "you"

3.3.2 Signal: Few or no instances of "I" or "you"

3.4 Signal: Vocabulary or reading level

3.5 Incivility and impoliteness

3.5.1 Signal: Incivility

3.5.2 Signal: Impoliteness

3.6 Text type

3.6.1 Signal: Text type is news

3.6.2 Signal: Text type is opinion

3.6.3 Signal: Text type is satire

3.6.4 (Section with no title?)

4. Subject type: Image

4.1 Implied association or tone

4.1.1 Signal: Flattering image

4.1.2 Signal: Unflattering image

4.2 Originality of Photo Used in an Article

4.2.1 Originality Types

4.2.1.1 Signal: Most Likely Original

4.2.1.2 Signal: Appears to be a Copy, with Some Modifications

4.2.1.3 Signal: Is a copy of a previously published image

4.2.1.4 Signal: Is extensively modified from a previously published image

4.2.2 (Section with no title?)

4.2.3 Attribution of Non-Original Photo

5. Subject type: Audio

5.1 Audio type

5.1.1 Signal: Audio type is news

5.1.2 Signal: Audio type is opinion

5.1.3 Signal: Audio type is advertising or marketing.

5.1.4 Signal: Audio roles - host

5.1.5 Signal: Audio roles - reporter