How to Curate a Gene Page

From Compendium of Cancer Genome Aberrations
Jump to navigation Jump to search

CCGA Gene curation guidelines: How to create a page in CCGA

Summary:

Introduction:

Thank you for volunteering to help curate the Compendium of Cancer Genome Aberrations (CCGA)! Your help will make this resource a valuable tool for users of the CCGA, including researchers, clinicians and others. This short, written description will help you get started and should serve as a collection of best practices and content style as you curate. Please sign and date and return your “Honor Agreement” before starting your curation, which is available here: . (TO DO: make link to Honor Agreement, downloadable by PDF, if possible).There is also a video that should be very helpful to show you how to curate, and is required before you start to curate. It can be seen here:

TO DO: create a tutorial video based on this script

The basic logic for CCGA pages is that there is basic Gene and Protein (and mutation) molecular biology-type information on the “Gene Pages”, and that there is Disease and clinical-type information type on the “Disease pages”. The CCGA is especially interested in curation the fusion genes/mutations that arise in disease, esp. in the hematological cancers. However, the dividing line between "Gene/Protein/Mutation" information and "Disease" information is sometime hard to determine. These guidelines will help you determine what information goes where. Please note that you should plan to spend between 4-8 hours in curating the information onto a single gene page, more if you are unfamiliar with the gene.

Editor:

Your curation of the gene and protein information will be aided and edited and reviewed by an “EDITOR”. Your assigned Editor will be your go-to person to help you curate and review your curated information after you are done curating. Please note that this written document and the curation tutorial cannot cover all the questions and decisions your will be faced with as you curate. Therefore, PLEASE feel free to contact your editor with any questions. You will need to contact your assigned editor to get “write” permissions to the CCGA so that you can create and edit pages in the Wiki.

Wiki Pages:

The CCGA web site is based on wiki pages and can be accessed here : http://www.ccga.io/index.php/Main_Page. please ask your editor for write premissions.

The functionality of the wiki pages are described in brief, at MediaWiki, see here: https://www.mediawiki.org/wiki/MediaWiki.

There are also short videos available on youtube that describe MediaWiki functionality here: https://www.youtube.com/watch?v=F8irbbwNo2E&list=PLAagofQWV6pf0xFyUw7gJg2yYYB-nCS4l. Others are available, by searching youtube for "MediaWiki tutorial".

Gene Pages:

You will be asked to choose from a number gene pages that you would like to curate. See list here: http://www.ccga.io/index.php/List_of_Gene_Pages A “Gene specific Template” has been produced which provides you with very nice Media Wiki Template to fill in with information you have curated. It is shown here. http://www.ccga.io/index.php/Gene-Specific_Template.

Note that the Template already has the markup language you will need for headers, links, and other syntax to be used in the Gene Paes. Please do not change the syntax of the template, so users of the CCGA pages will want to see the same format from page to page.

NOTE: When you do large edits on a Wiki page, the security makes sure yo are a human with test at the top of the page and a simple mathmatical formula you must complete, as below. YOU MUST answe the mathmatical equation for your edits to be saved.

Your edit includes new external links. To protect the wiki against automated spam, we kindly ask you to solve the following task below and enter the answer in the box in order to save your edit (more info): 45−1 =


Gene curation using the “Gene specific Template” MediaWiki Template in CCGA

Copying the Template to a new gene page.

In most cases, the Gene you will curate has already been created, based on the Gene specific template. However, if not, you will need to copy the Gene specific template to a new page.

  1. Go to the gene specific template (http://www.ccga.io/index.php/Gene-Specific_Template )
  2. Click “edit source” link at top center-right of page
  3. Copy the entire page (eg on a Mac, “command (cmd) a”, and then “cmd c”
  4. Open in another browser window the gene page you will curate
  5. Paste in the template (cmd v) and then save (at bottom of page)

Curating the Template with Gene Information

The template provides an easy "fill in the blanks" WikiMedia page which already has formatted markups (for headers) and has examples for you to follow. The Template is described briefly, below.

Sections:

  1. Primary Authors
  2. Synonyms
  3. Genomic Location
  4. Cancer Category/Type
  5. Gene Overview
  6. Common Alteration Types
  7. Internal Pages
  8. External Links
  9. References

The sections of the Template are for ease of reading for the USER: HOWEVER, as a curator, you will want to curate and load information in the WIKI in a non-linear fashion. This is the suggested workflow:

1. PRIMARY AUTHORS section.

By adding your name to the PRIMARY AUTHORS section you can make sure that no-one else is curating this gene now, and so you will get credit for curating this information by your peers.

8. EXTERNAL LINKS Section.

To familiarize yourself with the latest information on the Gene you are curating, “Your Gene of Interest” (designated YGI throughout), please start with one of the last sections of the Template, the EXTERNAL LINKS Section. The listed resources will provide exhaustive (but not necessarily recent) molecular information about YGI, its mutations and its place in disease and cancer and treatment. The resources suggested as EXTERNAL LINKS as of Dec 2018 are as follows:

  1. Atlas of Genetics and Cytogenetics in Oncology and Haematology
  2. COSMIC
  3. CIViC
  4. St. Jude ProteinPaint
  5. Precision Medicine Knowledgebase (Weill Cornell)
  6. Cancer Index
  7. OncoKB
  8. NCBI Gene
  9. My Cancer Genome
  10. UniProt
  11. Pfam
  12. GeneCards
  13. OMIM
  14. LOVD(3)
  15. TICdb

Please note that some genes may have special resources devoted to them. An example is the International Agency for Research on Cancer page devoted to TP53 (see http://p53.iarc.fr/). This is unusual, but P53 is the most studied gene and protein on earth, so in this case, a specialized resource is justified. TO find if there are any special resources for YGI, perform a google search and/or

Please note that not all the external resources have links to YGI, and then please curate “NO entry for YGI at resource”. For example, the gene MEMCOM is not present in the OncoKB site, nor the CIViC site. In these cases, please write in "No Entry for YGI at Resource" in place of the hyperlink to that site. Please check all links after you have curated them, as some of the link syntax can be changed as you curate. As you read through each of the EXTERNAL LINK resources, note information that you can use. For example, the NCBI gene site and the GeneCard site are especially rich in Synonym information. Either note the information directly in the wiki page for YGI, or in a text document that you keep on your computer (and can cut and paste easily into the Wiki page later), or you can take notes by hand and re-write into the wiki page later (not suggested, very error-prone).

PLEASE NOTE: DO NOT PLAGIARIZE. You can borrow phrases here or there without attribution. Yo can even use a sentence here and there if you attribute the quote directly ( eg: "From NCBI"). However, you CANNOT cut and paste whole paragraphs from other resources into the CCGA pages.

2. SYNONYMS Section

As you read though the EXTERNAL LINKS resources, you will find many of them have lists of synonyms, esp. NCBI Gene and GeneCards. Please italicize the alternative gene names (synonyms).

3. GENOMIC LOCATION Section.

This information is most easily obtained from GeneCards, in the GeneCards section "Genomics" and subsection "Genomic Locations for YFG" and subsection "Genomic View for YFG"and sub-subsection "Cytogenetic band"

5. GENE OVERVIEW Section

This is where you will be compelled to find and read journal articles and book text. You may have found some relevant articles listed at some of the EXTERNAL LINKS sites (esp. Uniprot and NCBI Gene), but you will have to search for new, relevant articles in PubMed and also from the Hematological Cancer resource "WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues (download from http://publications.iarc.fr/_publications/media/download/1511/700ac655d7f248cf1044efd985275086ed4f341f.pdf). Note that yo will be finding information that will fit in other sections of the gene template, like CANCER CATEGORY TYPE and COMMON ALTERATION TYPES. This section should describe in 1 paragraph (approx 5-7 sentences):

  1. the molecular function of the gene/protein
  2. its normal cellular role
  3. its role in diseases, esp. cancer diseases in the CCGA.
  4. interesting or frequent mutations and their effects on specific cancers.
  5. role in drug resistance, if any.

A few examples are shown below. "The protein encoded by RUNX1 can bind the protein encoded by CBFB to form "Core Binding Factor", a hetero-dimeric transcription factor which regulates a number of genes responsible for hematopoiesis and osteogenesis [2]. Runx1 protein can bind to DNA as a monomer through the Runt domain within the Runx1 protein. RUNX1 is the most frequent target for chromosomal translocation in leukemia [1]. Alterations of RUNX1 are typically loss-of-function or decreased function, and are considered "secondary driver mutations" (disease progression) in sporadic leukemias [2]; however, germline RUNX1 mutations contribute to a lifetime risk for myeloid malignancy of about 44% [2]. RUNX1 mutations (loss-of-function or decreased function) have been associated with decreased P53 activity and increased DNA repair defects and increased inflammation [2]. RUNX1 mutations are associated with gene mutations in ASXL1, MLLPTD, and IDH1/IDH2, and are mutually exclusive with NPM1 mutations [3]. Non-complex RUNX1 mutations were found to be associated with resistance to chemotherapy, decreased disease free survival (DFS), event free survival (EFS) and overall survival (OS) [3]."


The ABL1 gene encodes a non-receptor tyrosine kinase that is ubiquitously expressed and involved in a large number of cellular processes (see "NCBI Gene). By far the most prevalent ABL1 alteration associated with cancer are the fusions of the ABL1 gene with a number of partners, but especially with the BCR gene in CML [1,2] and to a lesser extent in B-ALL and T-ALL. The head to tail arrangement of the BCR-ABL1 fusion gene results in an activated tyrosine kinase activity [6]. It appears that the N-terminal domain of BCR can cause oligomerization of the BCR-ABL1 protein product, thus activating the ABL1 tyrosine kinase domain of the fusion protein [6,10,11]. The ABL1 and ABL2 genes encode tyrosine kinases which share overlapping physiological roles, and ABL2 somatic or amplification mutations are more common than similar mutations in ABL1 [6].

See the "BCR gene" for additional details of the BCR-ABL1 gene fusion.

For molecular information and some disease and mutation information, this section is most informed by reading the EXTERNAL LINKS resources of:

  1. UniProt
  2. GeneCards
  3. NCBI Gene
  4. CIViC
  5. Cancer Index
  6. My Cancer Genome
  7. Pfam

For mutational information, this section is most informed by

  1. Atlas of Genetics and Cytogenetics in Oncology and Haematology
  2. COSMIC
  3. LOVD(3)
  4. TICdb
  5. St. Jude ProteinPaint
  6. Precision Medicine Knowledgebase (Weill Cornell)
  7. OncoKB

For Disease information, the best resources are:

  1. OMIM
  2. WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues (download from http://publications.iarc.fr/_publications/media/download/1511/700ac655d7f248cf1044efd985275086ed4f341f.pdf)
  3. Pubmed (search for YFG, esp. in o=context of "cancer" or specific cancers.

Note that in this section, you mauch

Appendix

Resources:

  1. Gene-Specific-Template: http://www.ccga.io/index.php/Gene-Specific_Template
  2. MediaWiki Help: https://www.mediawiki.org/wiki/MediaWiki
  3. YouTube videos for how to use MediaWiki: https://www.youtube.com/watch?v=F8irbbwNo2E&list=PLAagofQWV6pf0xFyUw7gJg2yYYB-nCS4l
  4. The CCGA web site is here : http://www.ccga.io/index.php/Main_Page
  5. “WHO Classification of Tumours of Haematopoietic and Lymphoid Tissues.pdf” Download from http://publications.iarc.fr/_publications/media/download/1511/700ac655d7f248cf1044efd985275086ed4f341f.pdf


FAQ (Frequently Asked Questions)