introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 6831582
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
35540535 | GT-AG | 0 | 1.000000099473604e-05 | 883 | rna-XM_009701919.1 6831582 | 1 | 36654 | 37536 | Cariama cristata 54380 | CAG|GTAGGTCAAA...ATAGCCTTTTTC/GGTTTTCTGATA...CAAAG|AGC | 1 | 1 | 3.715 |
35540536 | GT-AG | 0 | 1.000000099473604e-05 | 1778 | rna-XM_009701919.1 6831582 | 2 | 34683 | 36460 | Cariama cristata 54380 | CCA|GTAAGTACAG...TCTATCTCAGCT/CTCTATCTCAGC...AACAG|ATA | 2 | 1 | 10.48 |
35540537 | GT-AG | 0 | 1.000000099473604e-05 | 754 | rna-XM_009701919.1 6831582 | 3 | 33863 | 34616 | Cariama cristata 54380 | AAG|GTCAGTGGGC...TTTGTCTTAATT/TTTGTCTTAATT...TGCAG|TGA | 2 | 1 | 12.794 |
35540538 | GT-AG | 0 | 1.000000099473604e-05 | 457 | rna-XM_009701919.1 6831582 | 4 | 33240 | 33696 | Cariama cristata 54380 | GAG|GTAGGAAGCT...CCTGCCTTCTTT/GTGTTGTTCAAG...ATCAG|GGA | 0 | 1 | 18.612 |
35540539 | GT-AG | 0 | 1.000000099473604e-05 | 337 | rna-XM_009701919.1 6831582 | 5 | 32774 | 33110 | Cariama cristata 54380 | ACG|GTGAGTCATC...TCCATCTTGACT/ATATTTTTCATC...TTCAG|GAC | 0 | 1 | 23.134 |
35540540 | GT-AG | 0 | 1.000000099473604e-05 | 1791 | rna-XM_009701919.1 6831582 | 6 | 30863 | 32653 | Cariama cristata 54380 | CAG|GTGAGACTGC...AACATCTTCTCC/GTTTGCATCAAT...TCCAG|GGA | 0 | 1 | 27.34 |
35540541 | GT-AG | 0 | 1.000000099473604e-05 | 388 | rna-XM_009701919.1 6831582 | 7 | 30398 | 30785 | Cariama cristata 54380 | ACT|GTAAGTGAAA...TTTTTCATAGTC/GTGTTTTTCATA...CACAG|GTA | 2 | 1 | 30.039 |
35540542 | GT-AG | 0 | 1.000000099473604e-05 | 714 | rna-XM_009701919.1 6831582 | 8 | 29617 | 30330 | Cariama cristata 54380 | AAG|GTTAGCAGTG...CTTCCCTTGTTA/CTATTATGCATT...TCCAG|GCT | 0 | 1 | 32.387 |
35540543 | GT-AG | 0 | 1.000000099473604e-05 | 2681 | rna-XM_009701919.1 6831582 | 9 | 26739 | 29419 | Cariama cristata 54380 | CAA|GTAAGGAGCC...CTTGCTTTGAAT/CTTGCTTTGAAT...TGCAG|GGG | 2 | 1 | 39.292 |
35540544 | GT-AG | 0 | 2.6223086462044077e-05 | 838 | rna-XM_009701919.1 6831582 | 10 | 25835 | 26672 | Cariama cristata 54380 | TGG|GTAAGCCTTG...TCACCTTTTGCC/GCCATGCTGACT...CGCAG|GTT | 2 | 1 | 41.605 |
35540545 | GT-AG | 0 | 1.2527713626055198e-05 | 1936 | rna-XM_009701919.1 6831582 | 11 | 23800 | 25735 | Cariama cristata 54380 | TGA|GTAAGTAAAA...ACAGTCTTAAAT/ATTCTGTTTATC...TGCAG|GTA | 2 | 1 | 45.075 |
35540546 | GT-AG | 0 | 1.000000099473604e-05 | 305 | rna-XM_009701919.1 6831582 | 12 | 23336 | 23640 | Cariama cristata 54380 | TAG|GTAAAGGTGG...TGTTTTCTATTG/TTGTTTTCTATT...TCCAG|AGG | 2 | 1 | 50.648 |
35540547 | GT-AG | 0 | 1.000000099473604e-05 | 1507 | rna-XM_009701919.1 6831582 | 13 | 21717 | 23223 | Cariama cristata 54380 | GTG|GTGAGTATGC...TCTGCTTTCAAA/AATGAATTCACT...TAAAG|GGT | 0 | 1 | 54.574 |
35540548 | GT-AG | 0 | 1.000000099473604e-05 | 1534 | rna-XM_009701919.1 6831582 | 14 | 20105 | 21638 | Cariama cristata 54380 | AAG|GTGAGCACGT...AACCCTTTATTT/CTCTGTCTAATG...TGCAG|GGA | 0 | 1 | 57.308 |
35540549 | GT-AG | 0 | 1.000000099473604e-05 | 1251 | rna-XM_009701919.1 6831582 | 15 | 18726 | 19976 | Cariama cristata 54380 | AGC|GTGAGCAACA...AATGCCGTGTTA/AAAAGCCTAATG...TGCAG|TCA | 2 | 1 | 61.795 |
35540550 | GT-AG | 0 | 1.000000099473604e-05 | 578 | rna-XM_009701919.1 6831582 | 16 | 17977 | 18554 | Cariama cristata 54380 | AAG|GTATGAGTCG...CCTCCCTCACCT/TCACCTCTCATC...TGCAG|GGG | 2 | 1 | 67.788 |
35540551 | GT-AG | 0 | 1.000000099473604e-05 | 570 | rna-XM_009701919.1 6831582 | 17 | 17313 | 17882 | Cariama cristata 54380 | AAG|GTAAAATTTA...TTTCCTCTATCA/TAAAATTTAATT...TGCAG|GCA | 0 | 1 | 71.083 |
35540552 | GT-AG | 0 | 1.4296846241399771e-05 | 1477 | rna-XM_009701919.1 6831582 | 18 | 15678 | 17154 | Cariama cristata 54380 | CCA|GTAAGTCAAA...TCAACCTTAAAA/AGGGAACTAAAC...TCTAG|GGG | 2 | 1 | 76.621 |
35540553 | GT-AG | 0 | 0.0001439490664218 | 1002 | rna-XM_009701919.1 6831582 | 19 | 14607 | 15608 | Cariama cristata 54380 | AAG|GTATGTGTCA...TCTGCTTTGCTT/ATAGGGCTCACT...CATAG|TGA | 2 | 1 | 79.04 |
35540554 | GT-AG | 0 | 1.000000099473604e-05 | 861 | rna-XM_009701919.1 6831582 | 20 | 13572 | 14432 | Cariama cristata 54380 | GAA|GTAAGTGACA...CTCTCTGTAATC/TGCCTTCTAAAT...TTCAG|ACA | 2 | 1 | 85.138 |
35540555 | GT-AG | 0 | 1.000000099473604e-05 | 423 | rna-XM_009701919.1 6831582 | 21 | 13106 | 13528 | Cariama cristata 54380 | CAG|GTAAGAAATA...ATGCTCTTACAG/GATGCTCTTACA...TTCAG|AGG | 0 | 1 | 86.646 |
35540556 | GT-AG | 0 | 1.000000099473604e-05 | 1039 | rna-XM_009701919.1 6831582 | 22 | 12005 | 13043 | Cariama cristata 54380 | GAT|GTAAGTCACT...GCATCTTTCACA/GCTGTTTTCATA...GACAG|CAC | 2 | 1 | 88.819 |
35540557 | GT-AG | 0 | 1.000000099473604e-05 | 1177 | rna-XM_009701919.1 6831582 | 23 | 10678 | 11854 | Cariama cristata 54380 | AAA|GTAAGTGGAA...TTTTTTTTTTCC/CTAGTTGTGACT...CCTAG|AAT | 2 | 1 | 94.076 |
35540558 | GT-AG | 0 | 1.000000099473604e-05 | 296 | rna-XM_009701919.1 6831582 | 24 | 10285 | 10580 | Cariama cristata 54380 | GAG|GTAAGGAAAT...TTCTCCTTGATG/TTCTCCTTGATG...TCCAG|TCA | 0 | 1 | 97.476 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);