Variants in Papers Mastermind Variants Track Settings
 
Genomenon Mastermind Variants extracted from full text publications

Track collection: Genetic Variants mentioned in scientific publications

+  Description
+  All tracks in this collection (2)

Display mode:      Duplicate track

Minimum MMCNT3 - articles with same protein change:
Minimum MMCNT2 - articles with exact cDNA and/or protein change:
Minimum MMCNT1 - articles with exact cDNA change:


Display data as a density graph:
Data schema/format description and download
Source data version: 2024.01.03
Assembly: Human Dec. 2013 (GRCh38/hg38)
Data last updated at UCSC: 2024-03-09 01:14:30

Description

This track shows most variants found in the full text of scientific publications gathered by Genomenon Mastermind. Mastermind uses a software that searches for disease-gene-variant associations in the scientific literature. The genome browser track shows only if a variant has been indexed by the search engine.

To get details on a variant (bibliographic references, disease, etc) click it and follow the "Protein change and link to details" at the top of the details page. Mouse over an item to show the gene and amino acid change and the scores MMCNT1, MMCNT2 and MMCNT3, explained below.

Genomenon Mastermind Genomic Search Engine is a commercial database of variants likely to be mentioned in full text scientific articles. A limited number of queries per week is free for healthcare professionals and researchers, if they register on the signup page page. Advanced features require a license for the Mastermind Professional Edition, which contains the same content but allows more comprehensive searches.

Display Conventions and Configuration

Genomic locations of variants are labeled with the nucleotide change. Hover over the features to see the gene, the amino acid change and the scores MMCNT1, MMCNT2 and MMCNT3, described below. All other information is shown on the respective Mastermind variant detail page, accessible via the "Protein change and link to details" at the top of the details page. The features are colored based on their evidence:

As suggested by Genomenom, we added a filter on all variants, so the data are not exactly identical to their website. We skip variants with more than one nucleotide and a MMCNT of 0 and where the variant is not an indel. This means that for longer variants, only variants are shown that are explicitly mentioned in the papers. This makes the data more specific.

Color Level of support
High: at least one paper mentions this exact cDNA change
Medium: at least two papers mention a variant that leads to the same amino acid change
Low: only a single paper mentions a variant that leads to the same amino acid change

The three numbers that are shown on the mouse-over and the details page have the following meaning (MM=Mastermind):

  • MMCNT1: cDNA-level exact matches. This is the number of articles that mention the variant at the nucleotide level in either the title/abstract or the full-text.
  • MMCNT2: cDNA-level possible matches. This is the number of articles with nucleotide-level matches (from 1) plus articles with protein-level matches in which the publication did not specify the cDNA-level change, meaning they could be referring to this nucleotide-level variant but there is insufficient data in these articles to determine conclusively.
  • MMCNT3: This is the number of articles citing any variant resulting in the same biological effect as this variant. This includes the articles from MMCNT1 and MMCNT2 plus articles with alternative cDNA-level variants that result in the same protein effect.
On the track settings page one can filter on these scores under the display mode section by entering a minimum number of articles for each kind of evidence.

Data access

The raw data can be explored interactively with the Table Browser or the Data Integrator. The data can be accessed from scripts through our API, the track name is "mastermind".

For automated download and analysis, the genome annotation is stored in a bigBed file that can be downloaded from our download server. The file for this track is called mastermind.bb. Individual regions or the whole genome annotation can be obtained using our tool bigBedToBed which can be compiled from the source code or downloaded as a precompiled binary for your system. Instructions for downloading source code and binaries can be found here. The tool can also be used to obtain only features within a given range, e.g. bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/hg19/bbi/mastermind.bb -chrom=chr21 -start=0 -end=100000000 stdout

Previous versions of this track can be found on our archive download server.

Methods

The Mastermind Cited Variants file was downloaded, converted to BED format with scripts that are available in our Git repository and converted to a bigBed file with the UCSC genome browser tool bedToBigBed.

This track is automatically updated two weeks after every Mastermind CVR release, which happens every three months.

Credits

Thanks to Mark Kiel, Steve Schwartz and Clayton Wheeler from Genomenon for making these data available.

References

Chunn LM, Nefcy DC, Scouten RW, Tarpey RP, Chauhan G, Lim MS, Elenitoba-Johnson KSJ, Schwartz SA, Kiel MJ. Mastermind: A Comprehensive Genomic Association Search Engine for Empirical Evidence Curation and Genetic Variant Interpretation. Front Genet. 2020 Nov 13;11:577152. doi: 10.3389/fgene.2020.577152. PMID: 33281875; PMC: PMC7691534