Gene summary


Listed at the top of this page are the gene symbol, name, source of the name, and location (in chromosome and base pairs). The gene name refers to the symbol extracted from publications or submitted directly by the community.

VectorBase IDs use a four-letter code to represent the species and strain information e.g. 'AGAP' for Anopheles gambiae. The 6-digit number following each ID is unique for that gene, and stable (unchanging), unless the gene model drastically changes.


The transcript table shows all protein-coding transcripts (splice variants), translations, and non-coding transcripts annotated within the gene. For more information about the transcript table and/or biotypes, see the transcript summary help page or the Ensembl glossary.


Below the transcript table is more information about the gene. This information includes:

  • Name - The HGNC name (for human), or the best match to a known gene name in a public database.
  • Synonyms - Other gene names used for this particular gene.
  • Gene type - The gene type includes both status and biotype. More information about these properties are explained further down this page.
  • Prediction method - Indicates if automatic annotation and/or manual curation was used to determine transcripts belonging to this gene.


All transcripts for a gene (including protein-coding splice-variants and non-coding transcripts) are listed in the table, and drawn in the diagram. Click on any VectorBase transcript ID in the transcript table, or click any transcript in the diagram, to select one particular transcript.

Individual transcripts for a gene are drawn as boxes for exons and connecting lines for introns. Filled or darkened boxes show coding sequence, and empty boxes show UTR (Untranslated Region). Transcripts drawn above the chromosome (blue bar) are on the forward strand, while transcripts below are on the reverse strand.

Red or black transcripts are protein coding.
Please see the general identifiers link at the left of the transcript tab. This will show you matching IDs in other databases, and may help you decide on a transcript.

Grey transcripts are noncoding. Go to the transcript summary help page for more information.


  • Known gene has at least one transcript with a sequence match in a sequence repository external to VectorBase for the same species.
  • Known by Projection refers to genes that are homologous, based on VectorBase comparative analysis, to genes with Known status in another species (usually human genes).
  • Novel gene contains only transcripts that have a sequence match outside VectorBase for an alternate species. (Can be read as novel gene or transcript for this species).
  • Merged gene has at least one merged (gold) transcript.

Note: Links at the left of the page are for gene-related information only. For more specific information on the transcript level, such as the cDNA or protein sequence, click on a transcript (either an identifier in the table, or a transcript in the diagram). This will open the transcript tab.