Referee data collected for Peer Review Week 2018 from Wellcome Open Research Dataset description: Author and referee gender data collected from Wellcome Open Research for Peer Review Week 2018 (September 10-15, 2018). All data has been de-identified. Description of each file Wellcome Open Research author gender data.csv CSV file containing the gender of authors of articles published on Wellcome Open Research. This data was collected using Gender API (https://gender-api.com/), which returns the probability of whether a first name belongs to someone who is male or female. To prevent coding more gender-neutral names to one gender or another, we chose a cut-off of 70% certainty that a name was male or female. Where certainty was less than 70%, gender determination was attempted manually using pronouns on bibliography pages. Names for which the API returned no result and which could not be determined manually are coded as -1. Names which were determined manually because the API returned no result were coded as -2. U, unknown; M, male; F, female. Data gathered in August 2018. This dataset contains the following elements: Unique ID: Serial number assigned to uniquely identify each author in the dataset. Probability male: The probability that the first name of the author is male, according to Gender API. Where the probability was less than 0.70, gender determination was attempted manually using pronouns on bibliography page. Names for which the API returned no result and could not be determined manually are coded as -1. Names which were determined manually are coded as -2. Probability female: The probability that the first name of the author is female, according to Gender API. Where the probability was less than 0.70, gender determination was attempted manually using pronouns on bibliography page. Names for which the API returned no result and could not be determined manually are coded as -1. Names which were determined manually are coded as -2. Determined gender: The gender of the author determined using the probability that the first name is male or female according to Gender API and manual identification of pronouns on bibliography pages. Authors were determined to be male (M) or female (F) where the API calculated gender certainty for their name was greater than 0.70 or where gender-specifying pronouns were manually identified on bibliography pages. For names that did not meet the Gender API cut off, where the API returned no result and where manual identification was not possible the gender was determined to be unknown (U). Wellcome Open Research referee gender data.csv CSV file containing the gender of referees (not including co-referees) who were suggested to write a peer review report for Wellcome Open Research. This data was collected using Gender API (https://gender-api.com/), which returns the probability of whether a first name belongs to someone who is male or female. To prevent coding more gender-neutral names to one gender or another, we chose a cut-off of 70% certainty that a name was male or female. Where certainty was less than 70%, gender determination was attempted manually using pronouns on bibliography pages. Names for which the API returned no result and which could not be determined manually are coded as -1. Names which were determined manually because the API returned no result were coded as -2. U, unknown; M, male; F, female. Three months after a paper has completed peer review if a suggested/invited referee doesn’t provide a review then identifying information (name, affiliation, email address) is deleted. Data gathered August 2018. This dataset contains the following elements: Unique ID: Serial number assigned to uniquely identify each referee in the dataset. Report published: Indicates whether the suggested referee published a peer review report for the paper. Y, the referee published a peer review report; N, the referee did not publish a peer review report. Article referee status: Indicates whether the suggested referee was approved by the submitting author of the article. AUTHOR_APPROVED, the referee was approved; NOT_APPROPRIATE, the referee was not deemed suitable to be invited to review; AWAITING_AUTHOR, the referee is currently pending approval to be invited to review; NONE, the referee’s invitation process has been put on hold. Article referee source: Indicates whether the referee was suggested by the author (SUGGEST BY AUTHOR), another referee (SUGGEST BY REFEREE), a member of the Wellcome Open Research editorial team (SUGGEST BY EDITOR). Algorithm suggestion: Indicates whether the suggested referee was generated using the peer review selector tool offered by Wellcome Open Research. Y, the tool was used; N, the tool was not used. Probability male: The probability that the first name of the referee is male, according to Gender API. Where the probability was less than 0.70, gender determination was attempted manually using pronouns on bibliography page. Names for which the API returned no result and could not be determined manually are coded as -1. Names which were determined manually are coded as -2. Probability female: The probability that the first name of the referee is female, according to Gender API. Where the probability was less than 0.70, gender determination was attempted manually using pronouns on bibliography page. Names for which the API returned no result and could not be determined manually are coded as -1. Names which were determined manually are coded as -2. Determined gender: The gender of the referee determined using the probability that the first name is male or female according to Gender API and manual identification of pronouns on bibliography pages. Referees were determined to be male (M) or female (F) where the API calculated gender certainty for their name was greater than 0.70 or where gender-specifying pronouns were manually identified on bibliography pages. For names that did not meet the Gender API cut off, where the API returned no result and where manual identification was not possible the gender was determined to be unknown (U). Determined author gender: The gender of the submitting author determined using the probability that the first name is male or female according to Gender API and manual identification of pronouns on bibliography pages. Submitting authors were determined to be male (M) or female (F) where the API calculated gender certainty for their name was greater than 0.70 or where gender-specifying pronouns were manually identified on bibliography pages. For names that did not meet the Gender API cut off, where the API returned no result, or where manual identification was not possible the gender was determined to be unknown (U).