introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
20 rows where transcript_id = 15198758
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 82238813 | GT-AG | 0 | 2.9725137083548205e-05 | 342 | rna-XM_005415640.2 15198758 | 2 | 12381571 | 12381912 | Geospiza fortis 48883 | CTG|GTAATGTTTC...ATTGCTTTAAAC/ATGTTACTAATT...TAAAG|ATG | 1 | 1 | 12.832 | 
| 82238814 | GT-AG | 0 | 3.078936707665029e-05 | 2079 | rna-XM_005415640.2 15198758 | 3 | 12379382 | 12381460 | Geospiza fortis 48883 | GCT|GTAAGTACTT...TGACACTTGATG/AGAAACCTCATT...TTTAG|GTC | 0 | 1 | 16.947 | 
| 82238815 | GT-AG | 0 | 1.000000099473604e-05 | 1407 | rna-XM_005415640.2 15198758 | 4 | 12377841 | 12379247 | Geospiza fortis 48883 | ATG|GTGAGCTCCT...TTTGTTTTCACT/TTTGTTTTCACT...TTTAG|TCC | 2 | 1 | 21.96 | 
| 82238816 | GT-AG | 0 | 1.000000099473604e-05 | 167 | rna-XM_005415640.2 15198758 | 5 | 12377495 | 12377661 | Geospiza fortis 48883 | CAG|GTAAATCCTT...TGGTCATTAACA/CATTTGCTGAAT...ATTAG|GAC | 1 | 1 | 28.657 | 
| 82238817 | GT-AG | 0 | 1.1925176358519715e-05 | 105 | rna-XM_005415640.2 15198758 | 6 | 12377228 | 12377332 | Geospiza fortis 48883 | CAG|GTAGACCAAC...GTCACTTTGAAA/CCTACTCTCAGT...TACAG|GAG | 1 | 1 | 34.718 | 
| 82238818 | GT-AG | 0 | 0.0008149676427697 | 1204 | rna-XM_005415640.2 15198758 | 7 | 12375923 | 12377126 | Geospiza fortis 48883 | CAG|GTATGTTGAA...TTTCTCTGAATT/ATTGCATTCACT...CTCAG|GTA | 0 | 1 | 38.496 | 
| 82238819 | GT-AG | 0 | 0.0004129009186602 | 305 | rna-XM_005415640.2 15198758 | 8 | 12375503 | 12375807 | Geospiza fortis 48883 | CAA|GTAAGCTACA...GTTCCCTAAATG/TAAATGTTTACT...TCTAG|AGT | 1 | 1 | 42.798 | 
| 82238820 | GT-AG | 0 | 1.000000099473604e-05 | 849 | rna-XM_005415640.2 15198758 | 9 | 12374529 | 12375377 | Geospiza fortis 48883 | CAG|GTTAGTCTTG...TTTCTCTTATAT/TTTTCTCTTATA...CCCAG|GAG | 0 | 1 | 47.475 | 
| 82238821 | GT-AG | 0 | 1.000000099473604e-05 | 1307 | rna-XM_005415640.2 15198758 | 10 | 12373099 | 12374405 | Geospiza fortis 48883 | AAG|GTAAGAGCCC...TGATTTTTTTTA/CCTCCAGTTATT...AATAG|TGC | 0 | 1 | 52.076 | 
| 82238822 | GT-AG | 0 | 1.000000099473604e-05 | 3432 | rna-XM_005415640.2 15198758 | 11 | 12369576 | 12373007 | Geospiza fortis 48883 | GAG|GTAAGAATAG...TTATTTCTGACT/TTATTTCTGACT...TTTAG|GTA | 1 | 1 | 55.481 | 
| 82238823 | GT-AG | 0 | 1.000000099473604e-05 | 643 | rna-XM_005415640.2 15198758 | 12 | 12368812 | 12369454 | Geospiza fortis 48883 | CAG|GTCAGTATAG...CATTCTTGGATT/TATCGATTCATT...TTCAG|TCG | 2 | 1 | 60.007 | 
| 82238824 | GT-AG | 0 | 1.000000099473604e-05 | 698 | rna-XM_005415640.2 15198758 | 13 | 12367957 | 12368654 | Geospiza fortis 48883 | TCA|GTAAGTAAAC...GTGTCTTTTGCT/TTTGTATTCAGC...CTCAG|CAT | 0 | 1 | 65.881 | 
| 82238825 | GT-AG | 0 | 9.947488861293564e-05 | 449 | rna-XM_005415640.2 15198758 | 14 | 12367416 | 12367864 | Geospiza fortis 48883 | AGT|GTAAGTATTA...ATTTTCTAAATC/CATTTTCTAAAT...AACAG|GGA | 2 | 1 | 69.323 | 
| 82238826 | GT-AG | 0 | 1.000000099473604e-05 | 5930 | rna-XM_005415640.2 15198758 | 15 | 12361464 | 12367393 | Geospiza fortis 48883 | GAG|GTGAGTCATG...TTATTTTTAACT/TTATTTTTAACT...TTTAG|GTT | 0 | 1 | 70.146 | 
| 82238827 | GT-AG | 0 | 1.000000099473604e-05 | 2456 | rna-XM_005415640.2 15198758 | 16 | 12358867 | 12361322 | Geospiza fortis 48883 | AAG|GTAGGGTAAT...TTTTCTTTAGTA/TTTGTTTTAATT...GCTAG|GCG | 0 | 1 | 75.421 | 
| 82238828 | GT-AG | 0 | 0.0002005430620376 | 758 | rna-XM_005415640.2 15198758 | 17 | 12357942 | 12358699 | Geospiza fortis 48883 | CTT|GTAAGTGTCA...TTTTTTTTAACA/TTTTTTTTAACA...TACAG|AAA | 2 | 1 | 81.669 | 
| 82238829 | GT-AG | 0 | 1.452578848371484e-05 | 6895 | rna-XM_005415640.2 15198758 | 18 | 12350977 | 12357871 | Geospiza fortis 48883 | GAG|GTAAGCCATA...AATGCTTTATTG/CAATGCTTTATT...TACAG|CTG | 0 | 1 | 84.287 | 
| 82238830 | GT-AG | 0 | 2.4222620314911392e-05 | 1208 | rna-XM_005415640.2 15198758 | 19 | 12349664 | 12350871 | Geospiza fortis 48883 | AAG|GTAAATTTAC...AAAATCTTATAT/TAAAATCTTATA...AACAG|GAA | 0 | 1 | 88.215 | 
| 82238831 | GT-AG | 0 | 1.000000099473604e-05 | 2951 | rna-XM_005415640.2 15198758 | 20 | 12346626 | 12349576 | Geospiza fortis 48883 | GAG|GTAATGTAAC...TTTATTATAACT/TTTATTATAACT...GACAG|ATA | 0 | 1 | 91.47 | 
| 82238832 | GT-AG | 0 | 5.088171450740824e-05 | 4884 | rna-XM_005415640.2 15198758 | 21 | 12341688 | 12346571 | Geospiza fortis 48883 | CAG|GTATTAGCCT...TTGTTTTTAATT/TTGTTTTTAATT...TTTAG|GCT | 0 | 1 | 93.49 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);