Introduction
Welcome! In this project you will read simplified sentences generated by Artificial Intelligence and rate their quality.
This qualification HIT will train you to perform this task. You must be able to:
- Find the changes our AI made (i.e. "selecting spans")
- Evaluate the quality of each change
- Identify errors in each change
Here's a big picture of what we're doing:
Selecting Alignments
We'll begin by understanding the ways our AI simplifies sentences.
Each independent piece of information in a sentence is called a phrase. Our AI may add, delete or modify the wording within a phrase. It may also re-order phrases or split a complex sentence into two sentences. Each individual change to the sentence is called an edit. Your task is to align the edits between the complex and simplified sentences.
Types of Edits
We see two classes of edits. Phrase edits which modify a single piece of information and syntax edits which modify the structure of the sentence. We begin by introducing the phrase edits:
-
Deletion Edits - The AI attempted to simplify by deleting unnecessary information.
-
Insertion Edits - The AI attempted to add clarity by adding information which didn't previously exist.
-
Substitution Edits - The AI attempted to reword a complicated phrase or concept
The syntax edits will be less common, and will look very different from phrase edits:
-
Splitting Edits - The AI attemped to split a complex sentence into two simpler sentences.
-
Reordering Edits - The AI attempted to re-order the words within a phrase or phrases within a sentence.
-
Structural Edits - The AI attemped to modify attributes (like the tense, structure or voice) of the sentence to present information more clearly.
NOTE: Phrase edits and syntax edits are independent of each other! A single word can be a part of both a phrase edit and a syntax edit.
Here's an example of what selecting alignments looks like for one sentence:
Examples
This tutorial will primerily demonstrate how to annotate through examples. When you read through the examples, think about how you would annotate the spans:
A straightforward solution for foggy car windows is run the heater.
An easy fix for foggy car windows is run the heater.
Contrary to popular belief, the moon is made of rocks, not cheese.
The moon is made of rocks, not cheese.
The S&P 500 closed after heavy trading.
The S&P 500, an index fund of the largest 500 companies, closed after heavy trading.
Although1 the subject lacked character developement, he1 grew in other ways.
The subject lacked character development and1 grew in other ways.
How much do we select?
Generally, you want to select the smallest amount of text as possible which contains a single piece of information. Sometimes there can be two different types of edits next to each other. If a edit seems too large, try to highlight multiple edits instead.
Examples
A straightforward solution, which can sometimes take a while, for foggy car windows is to run the heater
An easy fix for foggy car windows is to run the heater
The tallest timber-frame structure in the U.S., by all accounts, is Carbon12, an eight-story condominium tower in Portland OR, completed in 2018.
The tallest timber-frame building in the U.S., using only wood pegs instead of nails and screws, is Carbon12 in Portland OR.
Remember! You may be tasked with identifing phrase and syntax edits at the same time.
The award-winning chef prepares1 each meal1 with loving care2.
Each meal1 is1 prepared1 with loving care2 by2 the award-winning chef.
The task find a model that best fits some observed data and prior information is known as data fitting 1.
Data fitting is 1 the task to1 find a model that best fits some information .
Capitalization & Ending Period
Some of our AI outputs automatically lowercase all the words in their output, add spacing around punctuation or remove the ending period in the sentence, please ignore these changes.
Georgia Tech stunned No. 24-ranked and defending Atlantic Coast Conference champion Pitt, 26-21, in Brent Key’s debut.
georgia tech stunned no . 24 - ranked and defending atlantic coast conference champion pitt , 26-21 , in brent key’s debut
Quiz
Now it's your turn! Take a look at these pairs of sentences and try to select the edits. You do not need to annotate your edits, just select the alignment!
1.1:
1.2:
In the real HIT, we have trained another AI to provide the alignment for you. However, this AI is frequently wrong (it will be wrong for almost every HIT), so please also fix its alignments. Go ahead and try it!
1.3:
Annotating Deletions
Now we've selected the alignments, let's look at how to annotate them. We'll look at each edit type, starting with deletions.
A deletion is an attempt to simplify by deleting unnecessary, irrelevant or complicated information or concepts.
Examples
The earliest available history shows that in 1831 the entire Heckscherville valley contained only two villages.
In 1831 the Heckscherville valley contained two villages.
If unsaturated air is passed through a spray of continuously recirculated water, the specific humidity will increase while the dry bulb temperature decreases.
If air is passed through recirculated water, the humidity will increase temperature decreases.
Rating by Severity
When content is deleted from a sentence, it will always contain some amount of information. We also ask that you rate how significant the deleted content is to the main idea of the original sentence. You will rate the significance on a 4 point scale.
Examples
Observe this original sentence:
Original Sentence (Human Written):
Born into slavery in Virginia in 1856, Booker T. Washington became an influential African American leader at the outset of the Progressive Era.
Simplified Sentence (Human or AI Model Written):
This sentence communicates many different facts. Here are just a few:
- Booker T. Washington was born in Virginia
- Booker T. Washington was born in 1856
- Booker T. Washington was born into slavery
- Booker T. Washington was an influential leader
- Booker T. Washington was a African American leader
- Booker T. Washington was a influential African American leader as a result of being born into slavery
- Booker T. Washington was a leader at the beginning of the Progressive Era
Hover over each piece of information to see which part of the sentence could be deleted to remove that information from the sentence. As you can see, sentences which communicate many ideas may be hard to narrow down whether a span is significant.
In this case, the main idea of the sentence is Booker T. Washington is an influential African American leader. Deletions are necessary for text simplification, we just want to ensure this main idea is still being communicated.
Let’s put it together with a few other sentences:
Like so many hyped books before it, The Midnight Library excited me and gave me pause.
The Midnight Library excited me and gave me pause.
Two security flaws, dubbed Meltdown and Spectre by researchers, were made public on 29 January 2018.
Two security flaws, dubbed Meltdown and Spectre by researchers, were made public.
Though part of the Purbeck Hills, Creech Barrow stands out, detached.
Creech Barrow stands out.
If glycolysis evolved relatively late, it likely would not be as universal in organisms as it is.
It likely would not be as universal in organisms as it is.
Coreference Errors
Deletions may also make a sentence unreadable by deleting a very specific piece of information. If a reference to a pronoun is deleted and not referenced elsewere in the sentence, this is a coreference error.
He, Euler, was also the first practitioner of graph theory.
He was also the first practitioner of graph theory.
As Euler looked for a solution to the Seven Bridges of Konigsbert, he, Euler, was also the first practitioner of graph theory.
As Euler looked for a solution to the Seven Bridges of Konigsbert, he was also the first practitioner of graph theory.
E. coli and salmonella can both cause food poisoning, while it (salmonella) exclusively comes from raw meat or unwashed produce.
E. coli and salmonella can both cause food poisoning, while it exclusively comes from raw meat or unwashed produce.
Grammar & Fluency Errors
Deletions may also introuduce errors in fluency or grammar. Fluency refers to the quality or flow of a sentence and grammar refers to the basic conventions.
In January, the Watergate burglars were convicted, along with Hunt and Liddy.
In January, the Watergate burglars along with Hunt and Liddy.
Let’s eat, grandpa!
Let’s eat grandpa!
NOTE: Sometimes you will see spacing errors. These are typically a formatting problems with our data and you do not need to annotate these errors.
Quiz
Now it’s your turn! Take a look at these sentences and try to identify and categorize the deletions.
2.1:
2.2:
2.3:
Annotating Insertions
An insertion is an attempt to add clarity by adding information which didn't previously exist. While deletions are straightforward, insertions may exibit many more types of errors. We’ll get to the errors later, but for now, let’s talk about good insertions:
- Elaboration - Added meaningful and correct extra information
A second non-traditional way to enter the M&A stream is through strategic board enhancements.
A second alternative, non-traditional way to enter the M&A (mergers & acquisitions) stream is through strategic board enhancements.
Simplifications may also introduce words which do not add new information. These trivial edits must be annotated, because even minor changes can still have major effects on the quality of a simplificaiton.
- Trivial Insertion - Added minor wording (the, a, etc.)
The FZD1 transcript is expressed in various tissues.
The FZD1 protein transcript is expressed in the various tissues.
Annotating Errors
Similar to deletions, insertions have their own types of errors.
-
Hallucination - New information is introduced but does not add clarity
-
Irrelevant - New information is introduced which is unrelated to the main idea
-
Contradiction - Phrase added but clearly contradicts information in the original sentence
-
Redundant - Phrase added but fails to contain new information
Let’s look at some examples of each:
Only certain organisms, called photoautotrophs, can perform photosynthesis.
Only certain organisms, called photoautotrophs, can perform photosynthesis when they decide to.
Only certain organisms, called photoautotrophs, can perform photosynthesis.
Only certain organisms can perform photosynthesis, called photoautotrophs, can perform photosynthesis.
In January, the Watergate burglars were convicted, along with Hunt and Liddy.
In January, the Watergate burglars were not convicted, along with Hunt and Liddy.
How big is the family you cook for?
How big is the family that you cook for?
Evaluating Factuality
Sometimes you will see an insertion which adds new information which you may not know if it's correct or not. In this case, try to do basic research to see if the insertion is easily, verifiably correct. If there's any ambiguity, feel free to leave a comment on that sentence.
Hillary Clinton was born in the fall of 1947.
Hillary Clinton was born in the fall of 1947 in Chicago.
Hillary Clinton was born in the fall of 1947.
Hillary Clinton was born in the fall of 1947 outside the United States.
Rating by Severity
Insertions also have a severity, meaning you will rate them on a scale of 1-3 by how helpful or harmful the insertion is. Here's some examples of different severity elaboration edits:
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals, which harm our environment, are increasing in abundance in the lower troposphere.
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals, which are bad, are increasing in abundance in the lower troposphere.
Here's some examples of different severity errors:
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals, which are chemicals, are increasing in abundance in the lower troposphere.
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals, which are decreasing, are increasing in abundance in the lower troposphere.
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals of varying size are increasing in abundance in the lower troposphere.
Many volitile organic chemicals are increasing in abundance in the lower troposphere.
Many volitile organic chemicals organic chemicals organic chemicals are increasing in abundance in the lower troposphere.
Putting it Together
Similar to deletions, insertions can introduce fluency and grammar errors. Remember! Grammar and fluency errors are independent of your other rating. You could have a helpful elaboration insertion which introduces an error or a high impact hallucination which does not introduce a error. Take a look a these examples with many insertions:
Atmospheric nitrogen is the largest pool of available nitrogen in terrestrial ecosystems.
Atmospheric nitrogen, in our air on earth, is the largest pool of an available nitrogen in terrestrial ecosystems our scientists had access to.
Éric Gauthier is also a novella author specialising in science fiction and fantasy.
Éric Gauthier, famous for his soloist dancing career, is also a novella author specialising in the science fiction and fantasy genres.
Quiz
Now it’s your turn! Take a look at these sentences and rate the insertions for type and quality.
3.1:
3.2:
Annotating Substitutions
Now that you learn what are deletions and insertions, let's look at a more complicated edit type: the substitution.
An edit is a substitution anytime information is retained from the complex to simple sentence but the wording is changed. Parts of the “information” in the original phrase may be added or removed, but the core meaning of the phrase is transfered from the original to simplified sentence. Sometimes, a substitution even entirely rewrites the same phrase to a completely different meaning.
Disinguish between substitution and deletion + insertion pair
Sometimes an edit is a substitution, and somtimes it is an insertion and deletion. This can be hard to differentiate, but here’s a general guideline:
- Substitution - if both spans are about same type of content (actions, locations, ...)
- Deletion + Insertion - otherwise
John was traveling in the summer in 2020.
John was traveling in the summer in Paris.
John was traveling in the summer in 2020.
John was skiing in the winter in 2021.
Types of Substitution
As mentioned before, substitution is categorized into four types:
-
Same meaning - This is also a paraphrase, the AI attempted to replace complicated words, while retaining the meaning
-
Less information - Similar to an insertion, the AI attempted to modify the phrase to add clarity to the sentence
-
More information - Similar to a deleton, the AI attempted to modify the phrase to remove unecessary information from the phrase
-
Different meaning - The AI removed all information and replaced it with new information
Let’s look at some examples on how to determine what kind of information change is in a substitution:
Multinomial logistic regression uses the softmax function to compute probabilities.
Logit regression uses a function to convert arbitrary numbers to probability to compute likelihood.
Product hops to albuterol inhalers containing hydrofluoroalkane rather than chlorofluorocarbons cost remunerators and outpatients billions of dollars.
Product transitions by companies to new products containing different chemicals cost payers and patients billions of dollars.
Paraphrasing
Likely the most common edit you will encounter is a paraphrase. The AI isn’t modifying the information, it is simply substituting complex words for simpler words. The substitution can be more simple, unchanged, or less simple.
In the best cases, these bonds transcend any simplistic dynamic of a caregiver and convalescent, instead embodying a profound reciprocity.
In the superlative cases, these bonds are more important than any simplistic relationship between a caregiver and a person being cared for, instead embodying a profound reciprocation.
Similar to past annotations, you will rate the severity as well:
At the international level sport is frankly mimic warfare.
At the multi-national level sport is frankly impressionist warfare.
The researchers conducted an assay.
The researchers conducted an investigation.
Meaning Transformation Error
Following other error types, we simply rate the severity for its impact on the clarity of the sentence. Like we mentioned earlier, it may be tough to distinguish between an insertion+deletion and a substitution (specifically a substitution with a totally different meaning). Here’s some examples:
Levels of adherence for each risk factor differed by aortic disease subtype, as did composite adherence.
Levels of adherence for each event differed by aortic disease subtype, as did the amount of exercise each participant could do.
Information Change
The last two types of substitution: less information and more information ask the exact same questions as deletion and insertion! Please refer back to those steps for in-depth information on these annotations. Here’s a few examples:
This study used in-person, semi-structured interviews with collegiate esports players to explore how players conceptualized their competitive gameplay through the serious leisure framework.
This study interviewed college-level esports players to explore how they saw their competitive gameplay through the “serious leisure framework”.
Herbert Spencer’s book makes the first thorough analysis of this agrarian society.
His book makes the first in-depth analysis of this rural society society.
Quiz
For this quiz, there may be more than only substitution edits. Please only select the other edits, you do not need to annotate them.
4.1:
4.2:
4.3:
Annotating Splits
A sentence split is an attempt to simplify the complex original sentence by
splitting it into two or more simpler sentences. Although the split edits are
easy to spot and are automatically marked as a || symbol,
split edits are usually accompanied by other edits like deletion, substitution, or insertion, which you need to mark as well.
For example, to split a sentence, only changing the punctuation like comma to period and uppercasing the first letter of the second sentence is usually not enough to make the new sentences grammatically correct.
Splits can either be good or unnecessary. A good split is one that makes the sentence simpler and easier to understand. An unnecessary split is one that does not make the sentence simpler.
A split can also introduce a grammar / fluency error, like any other edits.
Examples
Now let's look at some examples of split edits.
Split edits by only changing the punctuation:
A typical afternoon in Iceland can feel like standing in a wind tunnel loaded with seawater, and yet Icelandic horses trot around calmly.
A typical afternoon in Iceland can feel like standing in a wind tunnel loaded with seawater. || And yet Icelandic horses trot around calmly.
Split edits associated with a deletion edit:
Today is Jack's 21st birthday, and he and his famility goes to Florida to celebrate!
Today is Jack's 21st birthday. || He and his famility goes to Florida to celebrate!
Split edits associated with an insertion edit:
Animals — our sharp, loud, restless, dangerous, inconvenient planetary roommates — were pushed to the margins.
Animals are our sharp, loud, restless, dangerous, inconvenient planetary roommates. || But they were pushed to the margins.
Split edits associated with a substitution edit:
“People may be less likely to notice days with a modest increase in fine particulate matter from smoke,
but those days can still have an impact on people’s health,” said Marissa Childs, who led the research while getting her Ph.D from Stanford.
“People may be less likely to notice days with a modest increase in fine particulate matter from smoke,
but those days can still have an impact on people’s health,” said Marissa Childs. || She led the research while getting her Ph.D from Stanford.
Quiz
One of our annotators actually found this tutorial window doesn't work! In the real interface, we'll highlight the split for you, and you'll select the spans corresponding to the split. Just take a glance at the answer for this. Thank you.
5.1:
Annotating Ordering Changes
In a re-order edit, the AI attempted to simply by re-ordering words within a phrase or phrases within a sentence.
Because of his marked propensity toward procrastination and sloth, the old man was sympathetic and apathetic.
The old man was sympathetic and apathetic because of his marked propensity toward procrastination and sloth.
Because of his marked propensity toward procrastination1 and sloth, the old man was sympathetic2 and apathetic.
Because of his marked propensity toward sloth and procrastination1, the old man was apathetic and sympathetic2.
Rating by Severity
Re-order edits do not modify the information in the sentence. Therefore, we only rate for how the edit adds clarity. Similar to a paraphrase (i.e. a substitution which adds no new information), we rate whether the edit is helpful and its efficacy/severity.
The emergence of huge, dominant radio conglomerates like Clear Channel1 and Infinity is a direct consequence of the '96 Act32.
The '96 Act3 had a direct consequence of2 the emergence of huge, dominant radio conglomerates like Infinity and Clear Channel1.
Phrase vs. Syntax Edits
Typically a sentence doesn’t just move phrases around, it will modify the wording or information in that phrase. Because of this, an edit can be both a phrase edit and a syntax edit.
The fact of the matter is that it is not John Ziegler's job to be responsible,1 or nuanced, or to think about whether his on-air comments are productive or dangerous, or cogent, or even defensible2.
Whether what he says during his show is helpful or not,2 being responsible1 is not John Ziegler's job.
Quiz
6.1:
Annotating Structure Changes
In a structural edit, the AI attemped to modify the tense, structure or voice of the sentence to present the information more clearly. Unlike other edits, structural edits are a composite edit, meaning they are only seen as a combination of other edits. Let’s start with some examples of structural edits:
Donaldson attempted to speak clearly and he was successful.
Donaldson attempted to speak clearly and successfully.
We compute1 the Pearson correlation to asses2 annotation quality.
We computed1 the Pearson correlation when we assessed2 annotation quality.
Her book makes the first thorough analysis of this rural society.
The first thorough analysis of this rural society is made by her book.
Her book makes the first thorough analysis of this rural society, and one that draws on all available sources.
Drawing on all available sources, her book makes the first thorough analysis of this rural society.
As we have seen, a change in voice, tense or clause structure can all be types of structural changes.
Reorder vs. Structural Edits
A structural edit is independent of a re-order edit! Remember a re-order edit exists simply to annotate the order information is being presented has changed, while a structural edit requires some attribute of the sentence to be modified.
Although I found structural annotation difficult, with practice the annotation became easier.
With practice the annotation became easier, although I found structural annotation difficult.
Although I found structural annotation difficult, with practice the annotation became easier.
The annotation became easier with practice, although I found structural annotation difficult.
Although1 I found structural annotation difficult, with2 practice the annotation became3 easier.
I found structural annotation difficult because2 practice made3 the annotation easier.
Rating by Severity
We rate severity similar to the re-order edit. Because no information is changed, we only annotate for how the change adds clarity.
Like an actor repeating his part in an old play, Prince Vassily always spoke languidly.
Like an actor repeating his part in an old play, Prince Vassily had1 always spoke languidly.
Like an actor repeating his part in an old play1, Prince Vassily always spoke languidly.
Prince Vassily always spoke languidly, like an actor repeating his part in an old play1.
Like an actor repeating his part in an old play, Prince Vassily always spoke languidly1.
Like an actor repeating his part in an old play, languidly was how1 Prince Vassily always spoke.
Phrase vs. Syntax Edits
As you can see phrase edits can only capture how individual pieces of information or a small set of words change. When the AI creates an edit which modifies the sentence as a whole, this must be captured by a structural edit. Here’s some examples of overlapping phrase and syntax edits:
The amount that Phoenix spends on criminal enforcement is difficult to quantify1, in part because2 the City does not classify arrests by housing status.
Because2 the City of Pheonix, in part, refuses to use an individual’s housing to organize arrests, it has become hard to find out how much it spends on policing1.
Only Select 1 Reorder
One particular case you may find is a structural change which re-orders a phrase. In this case, only select the phrase which you think best captures the re-order, not every phrase which is moved in the sentence.
Because ice cream sales have increased, we observe1 more reported drownings.
More reported drownings are observed1 because ice cream sales have increased.
Putting it Together
Structural edits are the least common and most complicated edit type, so if there is ambiguity in your answer, we encourage you provide a comment explaining why you made your decision. Here’s an example which may prove difficult to annotate:
Anna Pavlovna Scherer, in spite of her forty years1, was on the contrary2 brimming over with excitement and impulsiveness.
In spite of her many years1, Anna Pavlovna Scherer overflows with impulsiveness and excitement, on the contrary2 of her age.
Quiz
7.1:
Conclusion
Congradulations on completing the tutorial! You should be able to understand all types of phrase-level and sentence-level edits and the nuances in rating these edits. In the 'Quiz', you will use the onboarding data we provided to test your ability to rate real outputs from our AI models. If you have any questions, feel free to reach out.
When annotating, if you come across a difficult decision, please refer to this tutorial and make comment explaining your thought process. You can also refer to the below summary of definitions and reminders.
Phrase-level Edits
-
Deletion Edits - The AI attempted to simplify by deleting unnecessary information.
-
Insertion Edits - The AI attempted to add clarity by adding information which didn't previously exist.
-
Substitution Edits - The AI attempted to reword a complicated phrase or concept
Identifying Change in Information
Information change can be broadly organized into the following categories. While insertions primerily add information and deletions primerily remove information, substitutions can be vague:
-
Less information - Modify the phrase to add clarity to the sentence
-
Same meaning (Paraphrase) - Replace complicated words, while retaining the meaning
-
More information - Modify the phrase to remove unecessary information from the phrase
-
Different meaning - The AI removed all information and replaced it with new information
Phrase-level Errors
Information addition (either through an insertion or substitution) may take the following forms:
-
Elaboration - Added meaningful and correct extra information
-
Trivial Insertion - Added minor wording (the, a, etc.)
-
Hallucination - New information is introduced but does not add clarity
-
Irrelevant - New information is introduced which is unrelated to the main idea
-
Contradiction - Phrase added but clearly contradicts information in the original sentence
-
Redundant - Phrase added but fails to contain new information
Information removal (either through an deletion or substitution) may exibit errors as well:
-
Coreference Error - A reference to a pronoun is deleted and not referenced elsewhere in the sentence
Lastly, any edit (syntactic or phrasal), may exhibit:
-
Grammar & Fluency Error - A basic error in sentence grammar or fluency. Fluency refers to the quality or flow of a sentence and grammar refers to the basic conventions
Sentence-level Edits
-
Splitting Edits - The AI attemped to split a complex sentence into two simpler sentences.
-
Reordering Edits - The AI attempted to re-order the words within a phrase or phrases within a sentence.
-
Structural Edits - The AI attemped to modify attributes (like the tense, structure or voice) of the sentence to present information more clearly.
Remember strcutural edits are a combination of other edits! You can have up two three overlapping edits.
Decision Tree
This graphic outlines the different decisions you may make about edits in a sentence:
This concludes the tutorial of the interface.