introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
19 rows where transcript_id = 32672040
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position | 
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182504805 | GT-AG | 0 | 2.7838176808452685e-05 | 1242 | rna-XM_030232675.1 32672040 | 2 | 43153778 | 43155019 | Serinus canaria 9135 | CAG|GTACTTGGAG...ACATCTTTGACT/ACATCTTTGACT...CCCAG|GCT | 1 | 1 | 19.113 | 
| 182504806 | GT-AG | 0 | 1.000000099473604e-05 | 174 | rna-XM_030232675.1 32672040 | 3 | 43153470 | 43153643 | Serinus canaria 9135 | CTG|GTAAGGATGG...AAACCTTTGATC/AAACCTTTGATC...TCAAG|GTC | 0 | 1 | 22.343 | 
| 182504807 | GT-AG | 0 | 1.000000099473604e-05 | 1316 | rna-XM_030232675.1 32672040 | 4 | 43151571 | 43152886 | Serinus canaria 9135 | CTG|GTAAGAGGAC...CAATTCTTAACC/CAATTCTTAACC...TGCAG|AAA | 1 | 1 | 36.394 | 
| 182504808 | GT-AG | 0 | 1.000000099473604e-05 | 1891 | rna-XM_030232675.1 32672040 | 5 | 43149545 | 43151435 | Serinus canaria 9135 | CAG|GTAATGTCAC...TTCTCCTCAGCC/AAAATGTTCATC...TGCAG|AGT | 1 | 1 | 39.648 | 
| 182504809 | GT-AG | 0 | 1.000000099473604e-05 | 1363 | rna-XM_030232675.1 32672040 | 6 | 43147960 | 43149322 | Serinus canaria 9135 | TTG|GTAAGTGTTT...ATTTCTATATTA/CTGGTGTTTATC...TTCAG|GTG | 1 | 1 | 44.999 | 
| 182504810 | GT-AG | 0 | 1.1205203321776204e-05 | 1144 | rna-XM_030232675.1 32672040 | 7 | 43146642 | 43147785 | Serinus canaria 9135 | TAG|GTAAGTTGGA...TATATTTTAACA/TATATTTTAACA...TTCAG|CAC | 1 | 1 | 49.193 | 
| 182504811 | GT-AG | 0 | 1.000000099473604e-05 | 1113 | rna-XM_030232675.1 32672040 | 8 | 43145383 | 43146495 | Serinus canaria 9135 | CAG|GTAATGCTTC...GTTGTTTTGAAT/GTTGTTTTGAAT...TGCAG|GTA | 0 | 1 | 52.711 | 
| 182504812 | GT-AG | 0 | 0.0004906464762532 | 772 | rna-XM_030232675.1 32672040 | 9 | 43144407 | 43145178 | Serinus canaria 9135 | CAA|GTAACTGGGT...TTTCTTTTATTC/TTTTCTTTTATT...TGTAG|GGG | 0 | 1 | 57.628 | 
| 182504813 | GT-AG | 0 | 1.000000099473604e-05 | 1576 | rna-XM_030232675.1 32672040 | 10 | 43142629 | 43144204 | Serinus canaria 9135 | GTG|GTTAGTATAT...TTTTTCTTCTCC/TCTTCTCCCACC...TACAG|CTA | 1 | 1 | 62.497 | 
| 182504814 | GT-AG | 0 | 0.0014400065764644 | 955 | rna-XM_030232675.1 32672040 | 11 | 43141474 | 43142428 | Serinus canaria 9135 | CAG|GTATACCTGA...ATATGTTTAAGG/TAAGGGCTGATT...TACAG|GCA | 0 | 1 | 67.317 | 
| 182504815 | GT-AG | 0 | 1.000000099473604e-05 | 402 | rna-XM_030232675.1 32672040 | 12 | 43140861 | 43141262 | Serinus canaria 9135 | CAG|GTACATCACA...TGTTCCTGATTA/ATGTTCCTGATT...TATAG|GAT | 1 | 1 | 72.403 | 
| 182504816 | GT-AG | 0 | 0.0035731777445959 | 317 | rna-XM_030232675.1 32672040 | 13 | 43140378 | 43140694 | Serinus canaria 9135 | CAG|GTAGCTTCAA...CTCCCCGTATTT/CCAAAAATAACC...TTTAG|ATC | 2 | 1 | 76.404 | 
| 182504817 | GT-AG | 0 | 0.0003276270781996 | 1067 | rna-XM_030232675.1 32672040 | 14 | 43139146 | 43140212 | Serinus canaria 9135 | CAG|GTATGTACTT...TGTGCCTAAATT/AATTAGTTTATT...CCCAG|GTC | 2 | 1 | 80.381 | 
| 182504818 | GT-AG | 0 | 2.7049165751883432e-05 | 283 | rna-XM_030232675.1 32672040 | 15 | 43138651 | 43138933 | Serinus canaria 9135 | TTG|GTAAGCCTGT...ATCCCTCTGACT/CATATATTTATC...ACCAG|GAA | 1 | 1 | 85.49 | 
| 182504819 | GT-AG | 0 | 1.000000099473604e-05 | 1189 | rna-XM_030232675.1 32672040 | 16 | 43137330 | 43138518 | Serinus canaria 9135 | CAG|GTAAAAGATG...CTGGCTTTATCT/ACTGGCTTTATC...TGTAG|CTG | 1 | 1 | 88.672 | 
| 182504820 | GT-AG | 0 | 1.000000099473604e-05 | 939 | rna-XM_030232675.1 32672040 | 17 | 43136290 | 43137228 | Serinus canaria 9135 | CAG|GTCTGGTCTC...TTATTTTTATAT/GTTATTTTTATA...GCTAG|CCT | 0 | 1 | 91.106 | 
| 182504821 | GT-AG | 0 | 3.53546724324867e-05 | 455 | rna-XM_030232675.1 32672040 | 18 | 43135746 | 43136200 | Serinus canaria 9135 | AAG|GTAATCAAAT...CTCACCTTGATT/TGTTTCTTCATG...TTTAG|GTT | 2 | 1 | 93.251 | 
| 182504822 | GT-AG | 0 | 0.0020979539937577 | 978 | rna-XM_030232675.1 32672040 | 19 | 43134659 | 43135636 | Serinus canaria 9135 | CAG|GTATTTTGCT...TCATCCTTCTCT/AGTATTCTCACT...TACAG|GAA | 0 | 1 | 95.879 | 
| 182514841 | GT-AG | 0 | 1.000000099473604e-05 | 162 | rna-XM_030232675.1 32672040 | 1 | 43155789 | 43155950 | Serinus canaria 9135 | CCG|GTGAGTGCTG...GCTGTGTTTGTC/GCGGGGCGGACC...CGCAG|GCA | 0 | 1.518 | 
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);