Tips For Requesters On Mechanical Turk: Academic Requesters: How to prevent retakes on your surveys without risking worker accounts

Thursday, September 27, 2012

Academic Requesters: How to prevent retakes on your surveys without risking worker accounts

EDIT: Current information is on THIS POST. The information below is dated and might not work.

Preventing retakes on your survey is the most common issue that requesters have with mTurk. This guide explains an easy way to prevent retakes just by using the Amazon mTurk Command Line Tools (CLT) that work on Windows, OS X and GNU/Linux.

Note about Amazon's Block Method

A Google search will yield this forum post where an mTurk representative tells requesters to use the block worker function to prevent survey retakes. Unfortunately, Amazon's system is not perfect, so even if you do this a worker may still get the following form email:

Greetings from Mechanical Turk.
We regret to inform you that you were blocked from working on HITs by the following requester(s):
Example
Requesters typically block Workers who submit poor quality work.
Requesters rely on Mechanical Turk for high quality work results. In order to maintain quality, we continuously monitor service activity. Should additional Requesters block you in the future, we may suspend your account. Please ensure your work quality is at a high standard. We encourage you to read the HIT instructions and follow them carefully.
We realize that this block may be a onetime occurrence for you. Should you maintain high work quality with no further complaints for the next few months we will dismiss this event.
Regards,
The Mechanical Turk Team

Not only that, if you want to give our multiple surveys of the same type workers who took them before won't be able to take a different kind of survey from you.
IsaacM says this should never happen but experience begs to differ as requesters who use this method still get these emails sent out to workers. (There are many posts on the mTurk worker forum to this effect.) If there is a "glitch" in mTurk workers are going to be emailing you pleading for an unblock and you will get a bad reputation which may make responses from future surveys come more slowly or not at all.

Preferred Method: Qualifications

Instead of potentially causing a lot of trouble, you can use mTurk's qualifications to keep workers out of surveys. Basically, how it works is this: Your survey has a qualification pre-attached to it called "Did my survey" and it requires a value of 0 from a worker, a worker requests the qualification and it is auto-granted to them at a value of 0, and once they take the survey the qualification is incremented by 1.
First, create a file that looks like this, and name it something like no_retakes.properties:

name:No retakes please!
description:Prevent retakes on my survey
keywords:prevent, retakes
autogranted:true
autograntedvalue:0

To make the qualification, execute this command with the mTurk Command Line Tools (note: In this post, I've used the Windows syntax for all command line examples. For OSX/Linux, prepend the characters ./ to the beginning of the command and append .sh to the end of the first word, e.g. createQualificationType.sh):

createQualificationType -properties noretakes.properties

You will get a QualTypeID printed to standard output as well as to a file called noretakes.success. You need this ID to change the values later.
Make sure to add this qualification to your hit.properties file if you are making a new HIT with the CLT or to the necessary qualifications in the Hosted Requester GUI. Remember that the value should be 0.
Once the first run of the survey is complete, you can now raise everyone's values.
Whether you used the getResults command line tool or the web UI to get your results file, you should still have all of the work ID's who submitted work to your survey. Create a tab-delimited text file with the columns workerid and score. A few programs can create these files, such as Microsoft Excel and LibreOffice Calc. In Excel, they are called .tsv (tab separated values) files while in Calc they are saved as .csv but with a different delimiter (namely, tab or \t).
Your file should look something like this, with the symbol → representing a tab:

workid→score
A1EXAMPLE→1
A2EXAMPLE→1
A3EXAMPLE→1

Then, run the following command to update everyone's qual score (note, the -qualtypeid parameter takes the QualTypeID generated earlier and stored in no_retakes.success)

updateQualificationScore -qualtypeid TPREVENTRETAKESQUALIDEXAMPLE -input noretakes.tsv

Harder method: Internal lists

A harder method to prevent retakes is to ask for a worker's work ID when they begin your survey, and if it's found on a list, tell them to return the HIT. Doing that is far out of the scope of this article, though, and this method is better as it will prevent any workers from even accepting a survey they cannot do, which will leave them on the site longer for people who can do them!
I hope this was useful for you and I hope your survey gets lots of replies. :) Remember to pay workers fairly! (at least 12 cents per minute)

15 comments:

tylerJanuary 25, 2013 at 8:16 PM
I've recently outlined the "Harder method" at my own blog: http://www.tylerjohnburleigh.com/mturk/2013/01/simplescreening-using-workerids/

It's not really all that hard. ;-)
ReplyDelete
Replies
Agapi Thaleia FytrakiFebruary 13, 2013 at 2:54 AM
Great, straighforward outline of the method! Thanks!
I implemented this yesterday.
However, I noticed that the response rate/ hour dropped dramatically!! Could it be that most people are screening on "HITs I am qualified for", so they don't see the ones using auto qualification in their search results?
ReplyDelete
Replies
AlMarch 22, 2013 at 12:28 AM
Hi,

I need help understanding the qualification method. I am an academic and plan on publishing a lot of surveys on MTurk, however, I do not want to use a method that causes inconvenience to workers. I am not allowed access to turkernation as I am currently not in the US.

Can you please help me?
ReplyDelete
Replies
AdminMarch 31, 2013 at 8:03 PM
Amazon only accepts requesters from the US.
ReplyDelete
Replies
JDM StudiesApril 3, 2013 at 5:14 AM
I used this auto-grant method earlier and it worked perfectly fine. But of late it does not seem to be working. Any idea what the problem could be. I tried finding solutions - it is not because of low payment or higher time or lack of worker population.
ReplyDelete
Replies
UnknownFebruary 27, 2014 at 4:42 PM
I've tried creating the "no retakes" qualification and while everything seems to work fine in practice (I'm able to accept the HIT from another account after publishing a batch), no one has completed my HIT. I've run nearly the exact same HIT before and received an excellent response rate. Any thoughts on why the response rate has dropped to zero? Thank you in advance for any suggestions.
ReplyDelete
Replies
AdminMarch 10, 2014 at 9:46 AM
It could be because of the default "masters" qualification that Amazon uses now. You may have published your HIT in the past and had it available to the entire workforce, but when you publish using "masters" qualification, you will have workers who are very picky about who they work for and how much they are paid.
ReplyDelete
Replies
DeeptiJune 19, 2014 at 1:28 PM
One thing that is not clear to me in this post is, until I run the "update qualification score" method, a is the worker allowed to pick up my task multiple times and do it? Or is the worker automatically blocked after the first time he or she picks up our task?
ReplyDelete
Replies
UnknownNovember 13, 2015 at 9:41 PM
I am a requester and I’m haveing alot of problems with retakers I pay around $5 I’m getting ready to pull my survey. I need a legitimate step by step how to stop that from happening. please help
ReplyDelete
Replies
theloveandlightstoreOctober 20, 2019 at 4:11 AM
This comment has been removed by a blog administrator.
ReplyDelete
Replies

Add comment