pax original

Paired box protein Pax-1

Annotation score:5 out of 5

The annotation score provides a heuristic measure of the annotation content of a UniProtKB entry or proteome. This score cannot be used as a measure of the accuracy of the annotation as we cannot define the ‘correct annotation’ for any given protein.

– Experimental evidence at protein level i

This indicates the type of evidence that supports the existence of the protein. Note that the ‘protein existence’ evidence does not give information on the accuracy or correctness of the sequence(s) displayed.

Select a section on the left to see content.

This section provides any useful information about the protein, mostly biological knowledge.


This subsection of the Function section specifies the position and type of each DNA-binding domain present within the protein.

Manual validated information which has been generated by the UniProtKB automatic annotation system.

Manual assertion according to rules i

The Gene Ontology (GO) project provides a set of hierarchical controlled vocabulary split into 3 categories:

GO – Molecular function i

  • DNA-binding transcription activator activity, RNA polymerase II-specific Source: Ensembl
  • DNA-binding transcription factor activity, RNA polymerase II-specific Source: NTNU_SB
  • RNA polymerase II cis-regulatory region sequence-specific DNA binding Source: Ensembl
  • sequence-specific double-stranded DNA binding Source: ARUK-UCL

Inferred from Direct Assay

Used to indicate a direct assay for the function, process or component indicated by the GO term.

More information in the GO evidence code guide

Inferred from direct assay i

GO – Biological process i
  • bone morphogenesis Source: Ensembl
  • CD4-positive, alpha-beta T cell differentiation Source: Ensembl
  • CD8-positive, alpha-beta T cell differentiation Source: Ensembl
  • cell population proliferation Source: Ensembl
  • parathyroid gland development Source: Ensembl
  • sclerotome development Source: Ensembl
  • skeletal system development Source: ProtInc

Traceable Author Statement

Used for information from review articles where the original experiments are traceable through that article and also for information from text books or dictionaries.

More information in the GO evidence code guide

Traceable author statement i

UniProtKB Keywords constitute a controlled vocabulary with a hierarchical structure. Keywords summarise the content of a UniProtKB entry and facilitate the search for proteins of interest.

Molecular function Activator, Developmental protein, DNA-binding
Biological process Transcription, Transcription regulation
Enzyme and pathway databases

Pathway Commons web resource for biological pathway data

SIGNOR Signaling Network Open Resource

This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.

Names & Taxonomy i

This subsection of the Names and taxonomy section provides an exhaustive list of all names of the protein, from commonly used to obsolete, to allow unambiguous identification of a protein.

This subsection of the Names and taxonomy section indicates the name(s) of the gene(s) that code for the protein sequence(s) described in the entry. Four distinct tokens exist: ‘Name’, ‘Synonyms’, ‘Ordered locus names’ and ‘ORF names’.

This subsection of the Names and taxonomy section provides information on the name(s) of the organism that is the source of the protein sequence.

This subsection of the Names and taxonomy section shows the unique identifier assigned by the NCBI to the source organism of the protein. This is known as the ‘taxonomic identifier’ or ‘taxid’.

This subsection of the Names and taxonomy section contains the taxonomic hierarchical classification lineage of the source organism. It lists the nodes as they appear top-down in the taxonomic tree, with the more general grouping listed first.

This subsection of the Names and taxonomy section is present for entries that are part of a proteome, i.e. of a set of proteins thought to be expressed by organisms whose genomes have been completely sequenced.

A UniProt proteome can consist of several components.
The component name refers to the genomic component encoding a set of proteins.

Organism-specific databases

Eukaryotic Pathogen Database Resources

Human Gene Nomenclature Database

Online Mendelian Inheritance in Man (OMIM)

neXtProt; the human protein knowledge platform

This section provides information on the location and the topology of the mature protein in the cell.

Subcellular location i

Extracellular region or secreted

Automatic computational assertion

Graphics by Christian Stolte & Seán O’Donoghue; Source:

  • UniProt annotation
  • GO – Cellular component
  • Nucleus
  • nuclear chromatin Source: NTNU_SB
Keywords – Cellular component i

This section provides information on the disease(s) and phenotype(s) associated with a protein.

Pathology & Biotech i

This subsection of the ‘Pathology and Biotech’ section provides information on the disease(s) associated with genetic variations in a given protein. The information is extracted from the scientific literature and diseases that are also described in the OMIM database are represented with a controlled vocabulary in the following way:

Involvement in disease i

Otofaciocervical syndrome 2 (OTFCS2) 1 Publication

Manually curated information for which there is published experimental evidence.

Manual assertion based on experiment in i

This subsection of the ‘Sequence’ section describes natural variant(s) of the protein sequence.

Manual assertion based on experiment in i

Keywords – Disease i
Organism-specific databases

MalaCards human disease database

Orphanet; a database dedicated to information on rare diseases and orphan drugs

The Pharmacogenetics and Pharmacogenomics Knowledge Base

Miscellaneous databases

Pharos NIH Druggable Genome Knowledgebase

Polymorphism and mutation databases

BioMuta curated single-nucleotide variation and disease association database

Domain mapping of disease mutations (DMDM)

This section describes post-translational modifications (PTMs) and/or processing events.

PTM / Processing i

Molecule processing

This subsection of the ‘PTM / Processing’ section describes the extent of a polypeptide chain in the mature protein following processing or proteolytic cleavage.

Proteomic databases

MassIVE – Mass Spectrometry Interactive Virtual Environment

PaxDb, a database of protein abundance averages across all three domains of life

PRoteomics IDEntifications database

ProteomicsDB: a multi-organism proteome resource

PTM databases

iPTMnet integrated resource for PTMs in systems biology context

Comprehensive resource for the study of protein post-translational modifications (PTMs) in human, mouse and rat.

This section provides information on the expression of a gene at the mRNA or protein level in cells or in tissues of multicellular organisms.

Gene expression databases

Bgee dataBase for Gene Expression Evolution

ExpressionAtlas, Differential and Baseline Expression

Genevisible search portal to normalized and curated expression data from Genevestigator

Organism-specific databases

Human Protein Atlas

This section provides information on the quaternary structure of a protein and on interaction(s) with other proteins or protein complexes.

Protein-protein interaction databases

The Biological General Repository for Interaction Datasets (BioGRID)

Molecular INTeraction database

STRING: functional protein association networks

Miscellaneous databases

RNAct, Protein-RNA interaction predictions for model organisms.

This section provides information on the tertiary and secondary structure of a protein.

3D structure databases

SWISS-MODEL Repository – a database of annotated 3D protein structure models

Database of comparative protein structure models

This section provides information on sequence similarities with other proteins and the domain(s) present in a protein.

Family & Domains i


This subsection of the ‘Family and Domains’ section describes a region of interest that cannot be described in other subsections.

Manual assertion according to rules i

Manual assertion according to rules i

Keywords – Domain i
Phylogenomic databases

evolutionary genealogy of genes: Non-supervised Orthologous Groups

The HOGENOM Database of Homologous Genes from Fully Sequenced Organisms

InParanoid: Eukaryotic Ortholog Groups

KEGG Orthology (KO)

Identification of Orthologs from Complete Genome Data

Database of Orthologous Groups

Database for complete collections of gene phylogenies

TreeFam database of animal gene trees

Family and domain databases

Conserved Domains Database

Gene3D Structural and Functional Annotation of Protein Families

Integrated resource of protein families, domains and functional sites

The PANTHER Classification System

Pfam protein domain database

Protein Motif fingerprint database; a protein domain database

Simple Modular Architecture Research Tool; a protein domain database

Superfamily database of structural and functional annotation

PROSITE; a protein domain and family database

This section displays by default the canonical protein sequence and upon request all isoforms described in the entry. It also includes information pertinent to the sequence(s), including length and molecular weight. The information is filed in different subsections. The current subsections and their content are listed below:

This subsection of the Sequence section indicates if the canonical sequence displayed by default in the entry is complete or not.

Sequence status i : Complete.

This entry describes 3

This subsection of the ‘Sequence’ section lists the alternative protein sequences (isoforms) that can be generated from the same gene by a single or by the combination of up to four biological events (alternative promoter usage, alternative splicing, alternative initiation and ribosomal frameshifting). Additionally, this section gives relevant information on each alternative protein isoform. This section is only present in reviewed entries, i.e. in UniProtKB/Swiss-Prot.

isoforms i produced by alternative splicing . AlignAdd to basketAdded to basket

This entry has 3 described isoforms and 1 potential isoform that is computationally mapped.Show allAlign All

This isoform has been chosen as the

What is the canonical sequence?

canonical i sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.

The checksum is a form of redundancy check that is calculated from the sequence. It is useful for tracking sequence updates.

It should be noted that while, in theory, two different sequences could have the same checksum value, the likelihood that this would happen is extremely low.

However UniProtKB may contain entries with identical sequences in case of multiple genes (paralogs).

The checksum is computed as the sequence 64-bit Cyclic Redundancy Check value (CRC64) using the generator polynomial: x 64 + x 4 + x 3 + x + 1. The algorithm is described in the ISO 3309 standard.

Press W.H., Flannery B.P., Teukolsky S.A. and Vetterling W.T.
Cyclic redundancy and other checksums
Numerical recipes in C 2nd ed., pp896-902, Cambridge University Press (1993))

Checksum: i 6B06A2AD302FDEEA

The sequence of this isoform differs from the canonical sequence as follows:

This protein is a transcriptional activator. It may play a role in the formation of segmented structures of the embryo. May play an important role in the normal development of the vertebral column (By similarity).

Edward Shearmur ‎– K-Pax Original Motion Picture Soundtrack


1 Grand Central 4:41
2 Good Morning Bess 2:47
3 Taxi Ride 3:51
4 Constellation Lyra 2:41
5 Bluebird 3:52
6 4th Of July 4:14
7 Prot Missing 2:30
8 Sarah 3:03
9 New Mexico 6:24
10 Powell’s Return 1:11
11 July 27th 4:40
12 Coda 3:21

Над релизом работали

  • Art Direction – Penny Bennett
  • Artwork [Package Coordination] – Laura Johnson (4), Thanne Tangel
  • Bass – Mike Elizondo
  • Composed By, Conductor – Edward Shearmur
  • Contractor [Orchestration] – Sandy Decrescen*
  • Coordinator [Soundtrack Coordination] – Leah M. Panlilio, Meredith Friedman
  • Copyist – Joann Kane
  • Design – Skouras Design
  • Directed By [A&R] – Randy Dry
  • Edited By [Music] – Daryl Kell
  • Executive-Producer – Iain Softley
  • Guitar – Lyle Workman
  • Mastered By – Patricia Sullivan
  • Mixed By – Chris Fogel
  • Orchestrated By – Brad Warnaar, Robert Elhai
  • Percussion [Alien] – Chas Smith
  • Photography By – Ken Regan, Suzanne Tenner
  • Producer – Edward Shearmur
  • Producer [Assistant] – Jeff Toyne
  • Programmed By – Bryan Carrigan, Herwig Maure*, Jeff McIlwain
  • Recorded By – Chris Fogel, Joel Iwataki
  • Voice [Solo] – Melissa Kaplan


Recorded at Music Grinder, Hollywood, CA and Sony Pictures Scoring, Culver City, CA.
Mixed at Front Page Recorder, Glendale, CA.
Mastered at Bernie Grundman Mastering.

Motion Picture Artwork, Artwork Title and Photos: TM © 2001 Universal Studios. All rights reserved. ℗ & © 2001 Universal Classics Group, a Division of UMG Recordings, Inc., Distributed by Decca Music Group Limited, London, England.

Посмотреть сведения об участниках альбома, рецензии, композиции и приобрести альбом 2001 CD от K-Pax Original Motion Picture Soundtrack на Discogs.