introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
36 rows where transcript_id = 25387404
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 140015830 | GT-AG | 0 | 3.781005761445905e-05 | 405 | rna-XM_040237309.1 25387404 | 2 | 96737986 | 96738390 | Oryx dammah 59534 | GAG|GTAAATTACT...TTGTTCTTGAAA/ATTGACTTAATT...GAAAG|GTG | 0 | 1 | 9.2 |
| 140015831 | GT-AG | 0 | 6.128590921481408e-05 | 1650 | rna-XM_040237309.1 25387404 | 3 | 96736183 | 96737832 | Oryx dammah 59534 | CAG|GTAAACAAAA...TTTCTCTTAACG/TATATTTTCATA...CTCAG|CTG | 0 | 1 | 12.81 |
| 140015832 | GT-AG | 0 | 1.000000099473604e-05 | 662 | rna-XM_040237309.1 25387404 | 4 | 96735439 | 96736100 | Oryx dammah 59534 | CAG|GTAAGTATGT...TCATTCTTGTTT/AACAAAATCATT...TCAAG|GTC | 1 | 1 | 14.744 |
| 140015833 | GT-AG | 0 | 3.0919063465931546e-05 | 2991 | rna-XM_040237309.1 25387404 | 5 | 96732367 | 96735357 | Oryx dammah 59534 | AAA|GTAAGTATAA...TATTTTTTAGCT/TTTTAGCTTATT...TCTAG|GCA | 1 | 1 | 16.655 |
| 140015834 | GT-AG | 0 | 1.000000099473604e-05 | 2283 | rna-XM_040237309.1 25387404 | 6 | 96729998 | 96732280 | Oryx dammah 59534 | ATG|GTAAGGTTGG...CCCCCCGTAACA/TTCTAGATCATT...TCTAG|CAC | 0 | 1 | 18.684 |
| 140015835 | GT-AG | 0 | 1.000000099473604e-05 | 350 | rna-XM_040237309.1 25387404 | 7 | 96729551 | 96729900 | Oryx dammah 59534 | CAG|GTATGGAAAT...CATTTCTTCCCC/GTTGTTGTCATT...CACAG|GGG | 1 | 1 | 20.972 |
| 140015836 | GT-AG | 0 | 1.000000099473604e-05 | 1095 | rna-XM_040237309.1 25387404 | 8 | 96728365 | 96729459 | Oryx dammah 59534 | GAA|GTGAGTAGGA...AGCTTCTGGATG/GATGTTCTGACA...TATAG|GCC | 2 | 1 | 23.119 |
| 140015837 | GT-AG | 0 | 1.000000099473604e-05 | 950 | rna-XM_040237309.1 25387404 | 9 | 96727291 | 96728240 | Oryx dammah 59534 | AAG|GTAGGTGCCT...TCTTTTTTCCCT/AAGATTTTGAGA...CCTAG|GTT | 0 | 1 | 26.044 |
| 140015838 | GT-AG | 0 | 1.000000099473604e-05 | 3077 | rna-XM_040237309.1 25387404 | 10 | 96724120 | 96727196 | Oryx dammah 59534 | ATG|GTAAGACAGC...ACGGCCTTCATT/ACGGCCTTCATT...TGCAG|TTC | 1 | 1 | 28.261 |
| 140015839 | GT-AG | 0 | 1.000000099473604e-05 | 1342 | rna-XM_040237309.1 25387404 | 11 | 96722547 | 96723888 | Oryx dammah 59534 | GAA|GTAAGGAGCT...ACCCTCTTCTCT/ATGCATCTAACC...CCCAG|ACC | 1 | 1 | 33.711 |
| 140015840 | GT-AG | 0 | 0.0011741058285942 | 795 | rna-XM_040237309.1 25387404 | 12 | 96721581 | 96722375 | Oryx dammah 59534 | GTG|GTATGTTCAG...TTTACTATGACA/TTGTAGTTTACT...CCTAG|GTG | 1 | 1 | 37.745 |
| 140015841 | GT-AG | 0 | 0.0003099710001162 | 1707 | rna-XM_040237309.1 25387404 | 13 | 96719774 | 96721480 | Oryx dammah 59534 | CAA|GTAGGTTGTT...CATGCTTTGAAA/ATTTTATTCATG...AACAG|AAT | 2 | 1 | 40.104 |
| 140015842 | GT-AG | 0 | 1.000000099473604e-05 | 3459 | rna-XM_040237309.1 25387404 | 14 | 96716132 | 96719590 | Oryx dammah 59534 | CAG|GTATTAAAAT...ATCTCCTTTTTT/TTTGTTTTCATC...CTCAG|TTC | 2 | 1 | 44.421 |
| 140015843 | GT-AG | 0 | 1.000000099473604e-05 | 330 | rna-XM_040237309.1 25387404 | 15 | 96715695 | 96716024 | Oryx dammah 59534 | GGG|GTAAGTATCA...CTTTTGTTAAAC/TTTTTTTTCAAA...TGCAG|AGT | 1 | 1 | 46.945 |
| 140015844 | GT-AG | 0 | 1.000000099473604e-05 | 1418 | rna-XM_040237309.1 25387404 | 16 | 96714173 | 96715590 | Oryx dammah 59534 | GAG|GTAAGTGAAC...CATATTTTATTT/TCATATTTTATT...TGCAG|GAA | 0 | 1 | 49.398 |
| 140015845 | GT-AG | 0 | 6.6561936291797e-05 | 92 | rna-XM_040237309.1 25387404 | 17 | 96714027 | 96714118 | Oryx dammah 59534 | GAG|GTAACATGGC...TCCCCCTTGCCT/ATATAACTAACT...TGTAG|GTT | 0 | 1 | 50.672 |
| 140015846 | GT-AG | 0 | 0.0006931528355809 | 819 | rna-XM_040237309.1 25387404 | 18 | 96713102 | 96713920 | Oryx dammah 59534 | AAA|GTAAGCTTAC...CTTGTCGTGACA/GTGATTCTCAAG...TTTAG|CAT | 1 | 1 | 53.173 |
| 140015847 | GT-AG | 0 | 0.0001216416797021 | 345 | rna-XM_040237309.1 25387404 | 19 | 96712641 | 96712985 | Oryx dammah 59534 | CAG|GTACATTGAT...TTCTGTTTAATT/ATGTTATTGATT...TTAAG|ATG | 0 | 1 | 55.909 |
| 140015848 | GT-AG | 0 | 1.000000099473604e-05 | 747 | rna-XM_040237309.1 25387404 | 20 | 96711820 | 96712566 | Oryx dammah 59534 | CAA|GTAAGTACCG...GTTTCTGTGAAA/AAGGAGCTGAGT...CCCAG|ACT | 2 | 1 | 57.655 |
| 140015849 | GT-AG | 0 | 0.0036724806113465 | 120 | rna-XM_040237309.1 25387404 | 21 | 96711621 | 96711740 | Oryx dammah 59534 | AAG|GTAACATTCT...TCTTTTTTAATT/TCTTTTTTAATT...GGTAG|GTG | 0 | 1 | 59.519 |
| 140015850 | GT-AG | 0 | 4.663034620111736e-05 | 1476 | rna-XM_040237309.1 25387404 | 22 | 96710065 | 96711540 | Oryx dammah 59534 | GAA|GTAAGTATTT...GCCCTTTTAATA/TACACACTTATT...CTTAG|GGA | 2 | 1 | 61.406 |
| 140015851 | GT-AG | 0 | 0.0001472583541768 | 109 | rna-XM_040237309.1 25387404 | 23 | 96709818 | 96709926 | Oryx dammah 59534 | CAA|GTACGTGTGC...TCGCTCTTATTG/ATCTTGTTCATC...TGCAG|GTA | 2 | 1 | 64.661 |
| 140015852 | GT-AG | 0 | 1.000000099473604e-05 | 320 | rna-XM_040237309.1 25387404 | 24 | 96709412 | 96709731 | Oryx dammah 59534 | AAG|GTAGAGACAT...GTTTCCTTACAC/TGTTTCCTTACA...GACAG|GAA | 1 | 1 | 66.69 |
| 140015853 | GT-AG | 0 | 4.0755473996733504e-05 | 1510 | rna-XM_040237309.1 25387404 | 25 | 96707753 | 96709262 | Oryx dammah 59534 | AAG|GTATGTGGAG...TATCCTTTAAAA/AATATAATCACA...TTTAG|GAT | 0 | 1 | 70.205 |
| 140015854 | GT-AG | 0 | 1.000000099473604e-05 | 936 | rna-XM_040237309.1 25387404 | 26 | 96706693 | 96707628 | Oryx dammah 59534 | GTG|GTAAGTATGA...TGAATCTTATTC/ATGAATCTTATT...TGTAG|GAC | 1 | 1 | 73.13 |
| 140015855 | GT-AG | 0 | 1.000000099473604e-05 | 1449 | rna-XM_040237309.1 25387404 | 27 | 96705146 | 96706594 | Oryx dammah 59534 | AAG|GTGGGTAACG...CTCTCCTCACTG/CCTCTCCTCACT...CCCAG|GAC | 0 | 1 | 75.442 |
| 140015856 | GT-AG | 0 | 1.000000099473604e-05 | 2472 | rna-XM_040237309.1 25387404 | 28 | 96702472 | 96704943 | Oryx dammah 59534 | CAG|GTAAACACGG...TGATCTTTGCCA/CAAATGCTTATT...TGCAG|GAA | 1 | 1 | 80.208 |
| 140015857 | GT-AG | 0 | 0.000637050660872 | 5680 | rna-XM_040237309.1 25387404 | 29 | 96696730 | 96702409 | Oryx dammah 59534 | CAG|GTAAACTCAG...TCTTCCTTGATG/TCTTCCTTGATG...GTCAG|GAC | 0 | 1 | 81.67 |
| 140015858 | GT-AG | 0 | 1.000000099473604e-05 | 372 | rna-XM_040237309.1 25387404 | 30 | 96696295 | 96696666 | Oryx dammah 59534 | CTG|GTAAGAGGGC...ATTATCTTATTT/AATTATCTTATT...TTTAG|GTA | 0 | 1 | 83.156 |
| 140015859 | GT-AG | 0 | 1.000000099473604e-05 | 1767 | rna-XM_040237309.1 25387404 | 31 | 96694467 | 96696233 | Oryx dammah 59534 | AAG|GTGAGGGTTG...TTTCTCTTTGCT/TCTTTGCTTAAA...CTCAG|CCC | 1 | 1 | 84.595 |
| 140015860 | GT-AG | 0 | 1.000000099473604e-05 | 629 | rna-XM_040237309.1 25387404 | 32 | 96693724 | 96694352 | Oryx dammah 59534 | CGG|GTGAGTGCCT...TGAACCTTATCT/GGAGTTTTCAAT...TCTAG|ATG | 1 | 1 | 87.285 |
| 140015861 | GT-AG | 0 | 0.0134983030335554 | 3195 | rna-XM_040237309.1 25387404 | 33 | 96690417 | 96693611 | Oryx dammah 59534 | AGC|GTATGTATCA...CCCCTTTTAAAA/ACATCTCTGATT...AACAG|GCG | 2 | 1 | 89.927 |
| 140015862 | GT-AG | 0 | 0.0357114733955055 | 795 | rna-XM_040237309.1 25387404 | 34 | 96689494 | 96690288 | Oryx dammah 59534 | AAG|GTATATTTTC...TACAACTTAACC/CACTCTCTCAAA...TCTAG|ATG | 1 | 1 | 92.946 |
| 140015863 | GT-AG | 0 | 0.0005545369541146 | 2077 | rna-XM_040237309.1 25387404 | 35 | 96687262 | 96689338 | Oryx dammah 59534 | CCT|GTAAGTTTTC...TAATCTTTGCTG/TCTTTGCTGATT...CTTAG|GTT | 0 | 1 | 96.603 |
| 140015864 | GT-AG | 0 | 1.000000099473604e-05 | 2430 | rna-XM_040237309.1 25387404 | 36 | 96684756 | 96687185 | Oryx dammah 59534 | TTG|GTTAGTATTT...TTTCTCTTTCCT/GTGTCTGTTACA...TTTAG|ATG | 1 | 1 | 98.396 |
| 140019952 | GT-AG | 0 | 1.000000099473604e-05 | 3262 | rna-XM_040237309.1 25387404 | 1 | 96738554 | 96741815 | Oryx dammah 59534 | CAG|GTGAGCGCTC...GTATTTGTACTT/CTATTTCTCAAG...TTTAG|GCA | 0 | 5.662 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);