introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
57 rows where transcript_id = 720740
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821452 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 1 | 11325817 | 11325869 | Adineta vaga 104782 | AAG|GTTAGATCTA...TTATTTTTTGTA/TTTTGTATGATT...AATAG|GTG | 1 | 1 | 1.037 |
3821453 | GT-AG | 0 | 0.0002135351899285 | 58 | rna-gnl|I4U23|003644-T1 720740 | 2 | 11325914 | 11325971 | Adineta vaga 104782 | CTC|GTAAATCTTT...GCATCGTTAATT/GTTGTTATTACT...TTTAG|GCG | 0 | 1 | 1.494 |
3821454 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003644-T1 720740 | 3 | 11326015 | 11326065 | Adineta vaga 104782 | TTC|GTAGGTCAAA...ATTAGTTTGAAA/TTTTAATTGAAT...TTTAG|GTA | 1 | 1 | 1.94 |
3821455 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|003644-T1 720740 | 4 | 11326264 | 11326315 | Adineta vaga 104782 | TGG|GTAGAATTAT...GATTTCTTCGTT/CTTGATTTCATT...TATAG|AAC | 1 | 1 | 3.994 |
3821456 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003644-T1 720740 | 5 | 11326492 | 11326545 | Adineta vaga 104782 | CAG|GTAAATATTT...TTAATCGTATTT/TGTTGTATGAAT...AATAG|ACA | 0 | 1 | 5.82 |
3821457 | GT-AG | 0 | 0.0006986412983541 | 54 | rna-gnl|I4U23|003644-T1 720740 | 6 | 11326609 | 11326662 | Adineta vaga 104782 | AAA|GTAAATTTTA...AAATCTTCGATT/AAATCTTCGATT...CATAG|TTT | 0 | 1 | 6.474 |
3821458 | GT-AG | 0 | 0.0032758570340181 | 55 | rna-gnl|I4U23|003644-T1 720740 | 7 | 11326738 | 11326792 | Adineta vaga 104782 | ATT|GTATAGATTT...TCTTGTTTGATT/TCTTGTTTGATT...TATAG|GAT | 0 | 1 | 7.252 |
3821459 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|003644-T1 720740 | 8 | 11326854 | 11326905 | Adineta vaga 104782 | ATG|GTCAGTTTCT...CATATTGTGATT/TGTGATTTCATC...TGTAG|TTT | 1 | 1 | 7.885 |
3821460 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|003644-T1 720740 | 9 | 11327037 | 11327092 | Adineta vaga 104782 | CAG|GTTCGAGAGA...AATCCATTGATT/CATTGATTAATT...TATAG|TAT | 0 | 1 | 9.244 |
3821461 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 10 | 11327200 | 11327252 | Adineta vaga 104782 | ACG|GTAAGAATAT...TTACTTTTGATA/TTACTTTTGATA...TATAG|TTG | 2 | 1 | 10.354 |
3821462 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|003644-T1 720740 | 11 | 11327422 | 11327478 | Adineta vaga 104782 | CGG|GTTAATATGA...TTTCTTTTATCT/GTTTCTTTTATC...TTCAG|GAA | 0 | 1 | 12.107 |
3821463 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|003644-T1 720740 | 12 | 11327542 | 11327602 | Adineta vaga 104782 | GAG|GTACGAAATG...ATGTTTTTATCT/CTTTTTTTCAGT...GTTAG|GTT | 0 | 1 | 12.761 |
3821464 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|003644-T1 720740 | 13 | 11327740 | 11327794 | Adineta vaga 104782 | TTT|GTTAGATTTA...AATGTCATGACT/AATGTCATGACT...TATAG|TGC | 2 | 1 | 14.182 |
3821465 | GT-AG | 0 | 0.0003449451597458 | 59 | rna-gnl|I4U23|003644-T1 720740 | 14 | 11327844 | 11327902 | Adineta vaga 104782 | GAA|GTAGATATCT...TTATTTTTGAAT/TTATTTTTGAAT...TTTAG|TAT | 0 | 1 | 14.69 |
3821466 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003644-T1 720740 | 15 | 11327983 | 11328033 | Adineta vaga 104782 | AAA|GTAAATCATC...GGCTTCTTTTCT/TTCAATTTTATA...AATAG|GTT | 2 | 1 | 15.52 |
3821467 | GT-AG | 0 | 0.0021772279359099 | 58 | rna-gnl|I4U23|003644-T1 720740 | 16 | 11328112 | 11328169 | Adineta vaga 104782 | TCT|GTTCGTTATT...ACTTTCTTAATC/TTTTTTTTTACT...TCTAG|AGC | 2 | 1 | 16.329 |
3821468 | GT-AG | 0 | 0.0046777872028801 | 57 | rna-gnl|I4U23|003644-T1 720740 | 17 | 11328372 | 11328428 | Adineta vaga 104782 | GCG|GTATTATTTC...AGATCTTTTTTT/TTTTGTGTGAAA...TTTAG|ATG | 0 | 1 | 18.425 |
3821469 | GT-AG | 0 | 0.0009298740263361 | 57 | rna-gnl|I4U23|003644-T1 720740 | 18 | 11328488 | 11328544 | Adineta vaga 104782 | AGG|GTATATTGAG...TGTATCTTTCTT/AAGTGTCTGATG...TTTAG|AGT | 2 | 1 | 19.037 |
3821470 | GT-AG | 0 | 0.0007357357924855 | 59 | rna-gnl|I4U23|003644-T1 720740 | 19 | 11328801 | 11328859 | Adineta vaga 104782 | CAA|GTATTAAATA...AATTTTTTATTT/TAATTTTTTATT...TCTAG|AGT | 0 | 1 | 21.693 |
3821471 | GT-AG | 0 | 0.000209049850515 | 55 | rna-gnl|I4U23|003644-T1 720740 | 20 | 11328942 | 11328996 | Adineta vaga 104782 | CAA|GTACGTAGAA...CATTTCTTAAAC/TCATTTTTCAAT...TTCAG|ATG | 1 | 1 | 22.544 |
3821472 | GT-AG | 0 | 0.0298507956247591 | 53 | rna-gnl|I4U23|003644-T1 720740 | 21 | 11329115 | 11329167 | Adineta vaga 104782 | ACT|GTATGTTGTC...TTTGTTTTGTTA/ATTGATTTCAAT...TGTAG|TTG | 2 | 1 | 23.768 |
3821473 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 22 | 11329307 | 11329359 | Adineta vaga 104782 | GAG|GTGAGTACAA...GTTTTCTTGATT/GTTTTCTTGATT...TTTAG|TTA | 0 | 1 | 25.21 |
3821474 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-gnl|I4U23|003644-T1 720740 | 23 | 11329510 | 11329557 | Adineta vaga 104782 | TAT|GTAAGCAAAC...ATCATTTCAGAT/TATCATTTCAGA...TTTAG|ATT | 0 | 1 | 26.766 |
3821475 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|003644-T1 720740 | 24 | 11331281 | 11331347 | Adineta vaga 104782 | ATG|GTAAGCAGGA...CCTATTTTAATG/TGTATGCTGATT...TATAG|TGA | 1 | 1 | 44.642 |
3821476 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|003644-T1 720740 | 25 | 11331683 | 11331739 | Adineta vaga 104782 | AAT|GTAAGATTCA...GACGATTTAACA/TCAATATTTATT...TTTAG|TTC | 0 | 1 | 48.117 |
3821477 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|003644-T1 720740 | 26 | 11331863 | 11331924 | Adineta vaga 104782 | CAT|GTGAGAAAAA...GCATTCTTATTC/TGCATTCTTATT...TCCAG|ATT | 0 | 1 | 49.393 |
3821478 | GT-AG | 0 | 0.0026272260799033 | 46 | rna-gnl|I4U23|003644-T1 720740 | 27 | 11332178 | 11332223 | Adineta vaga 104782 | CCG|GTTTGTTTCA...TTATCTTTAAAT/TTATCTTTAAAT...TTCAG|GTG | 1 | 1 | 52.018 |
3821479 | GT-AG | 0 | 0.0086134553591537 | 57 | rna-gnl|I4U23|003644-T1 720740 | 28 | 11332370 | 11332426 | Adineta vaga 104782 | AAA|GTATTTATCT...TAGTTTTTCATT/TAGTTTTTCATT...TTTAG|TTG | 0 | 1 | 53.533 |
3821480 | GT-AG | 0 | 1.000000099473604e-05 | 63 | rna-gnl|I4U23|003644-T1 720740 | 29 | 11332538 | 11332600 | Adineta vaga 104782 | GAT|GTAAGAGTTT...GAGATCTTGATT/GAGATCTTGATT...TTTAG|ATA | 0 | 1 | 54.684 |
3821481 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003644-T1 720740 | 30 | 11332754 | 11332804 | Adineta vaga 104782 | GAG|GTACAATGAG...TGATTTTTATAA/GTGATTTTTATA...ACTAG|TAT | 0 | 1 | 56.271 |
3821482 | GT-AG | 0 | 0.0002470892886401 | 59 | rna-gnl|I4U23|003644-T1 720740 | 31 | 11332844 | 11332902 | Adineta vaga 104782 | AAG|GTATATCTAG...GATGTTTTCATT/GATGTTTTCATT...TTCAG|GAA | 0 | 1 | 56.676 |
3821483 | GT-AG | 0 | 0.0024276698217413 | 55 | rna-gnl|I4U23|003644-T1 720740 | 32 | 11333491 | 11333545 | Adineta vaga 104782 | AAA|GTAAACTTTT...ATTTCATCAATA/TTATATTTCATC...TTTAG|ATA | 0 | 1 | 62.776 |
3821484 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 33 | 11333710 | 11333762 | Adineta vaga 104782 | TGA|GTAAGACATG...TGTTTGTTAATG/TGTTTGTTAATG...GATAG|GAA | 2 | 1 | 64.478 |
3821485 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|003644-T1 720740 | 34 | 11333851 | 11333899 | Adineta vaga 104782 | GTC|GTAAGTGAGT...TTGTATTTGATA/TTGTATTTGATA...TTTAG|ATA | 0 | 1 | 65.391 |
3821486 | GT-AG | 0 | 0.0001987716567472 | 52 | rna-gnl|I4U23|003644-T1 720740 | 35 | 11334032 | 11334083 | Adineta vaga 104782 | GAT|GTAAATATTA...TTTTCTTCGACG/AATTATTTGATG...TGTAG|CGT | 0 | 1 | 66.76 |
3821487 | GT-AG | 0 | 0.010605180621655 | 58 | rna-gnl|I4U23|003644-T1 720740 | 36 | 11334165 | 11334222 | Adineta vaga 104782 | ATT|GTATGTATAA...TATTTCTCATTT/GTATTTCTCATT...AATAG|ATT | 0 | 1 | 67.6 |
3821488 | GT-AG | 0 | 0.0003491443757672 | 55 | rna-gnl|I4U23|003644-T1 720740 | 37 | 11334343 | 11334397 | Adineta vaga 104782 | TTG|GTATAGATTT...ATGTTTATAATT/TTTATTTTTATG...TCTAG|ACA | 0 | 1 | 68.845 |
3821489 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003644-T1 720740 | 38 | 11334500 | 11334550 | Adineta vaga 104782 | AAG|GTTAGTTGAT...TATTTCTAAGTT/TTTTGTTTCACT...TATAG|ATT | 0 | 1 | 69.904 |
3821490 | GT-AG | 0 | 0.0016698790686261 | 62 | rna-gnl|I4U23|003644-T1 720740 | 39 | 11334641 | 11334702 | Adineta vaga 104782 | AAG|GTTTGCTTCA...TAGATCTTGAAA/AGTAATTTCATA...TATAG|GCT | 0 | 1 | 70.837 |
3821491 | GT-AG | 0 | 4.214259971739167e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 40 | 11334832 | 11334884 | Adineta vaga 104782 | ATG|GTAAAATTGA...TTTCTTTTAATT/TTTCTTTTAATT...TTCAG|ATC | 0 | 1 | 72.176 |
3821492 | GT-AG | 0 | 1.151980027110554 | 53 | rna-gnl|I4U23|003644-T1 720740 | 41 | 11334972 | 11335024 | Adineta vaga 104782 | ATA|GTATCGATTA...CTTTCTATAATC/TATAATCTCACA...TTTAG|ATG | 0 | 1 | 73.078 |
3821493 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|003644-T1 720740 | 42 | 11335097 | 11335154 | Adineta vaga 104782 | CAA|GTAAAAAATT...TTATCTATGAAA/TGTATTTTCAAA...CTTAG|ATA | 0 | 1 | 73.825 |
3821494 | GT-AG | 0 | 1.6498223611360433e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 43 | 11335439 | 11335491 | Adineta vaga 104782 | TAG|GTAAGTTTCT...ATTGTCTTTTCA/TGTCTTTTCACA...ATTAG|ACA | 2 | 1 | 76.771 |
3821495 | GT-AG | 0 | 0.0014221375723527 | 62 | rna-gnl|I4U23|003644-T1 720740 | 44 | 11335607 | 11335668 | Adineta vaga 104782 | CGT|GTAATTTTCT...TCATTCTAAATA/AAAAGATTCATT...AATAG|TCT | 0 | 1 | 77.965 |
3821496 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|003644-T1 720740 | 45 | 11335792 | 11335841 | Adineta vaga 104782 | AAT|GTGAGTTATC...TTTGTCTTGAAA/GAAAATCTAATC...TTTAG|CTT | 0 | 1 | 79.241 |
3821497 | GT-AG | 0 | 0.000607164599112 | 49 | rna-gnl|I4U23|003644-T1 720740 | 46 | 11336139 | 11336187 | Adineta vaga 104782 | CGT|GTAAGTTTTG...ATATCATTATCA/ATCATTATCATT...TATAG|CGT | 0 | 1 | 82.322 |
3821498 | GT-AG | 0 | 0.0328662127413842 | 56 | rna-gnl|I4U23|003644-T1 720740 | 47 | 11336472 | 11336527 | Adineta vaga 104782 | CAA|GTATGCAATA...TATTTCTCAATC/ATATTTCTCAAT...TTTAG|ATA | 2 | 1 | 85.268 |
3821499 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|003644-T1 720740 | 48 | 11336616 | 11336670 | Adineta vaga 104782 | AAG|GTTAGTTTTC...TATATCTAATAT/TTATATCTAATA...TTTAG|AAA | 0 | 1 | 86.181 |
3821500 | GT-AG | 0 | 3.208752470826608e-05 | 51 | rna-gnl|I4U23|003644-T1 720740 | 49 | 11336803 | 11336853 | Adineta vaga 104782 | AAA|GTAAATTATT...ACTTTTCTATTC/TTTCTATTCAGT...TCCAG|ATT | 0 | 1 | 87.551 |
3821501 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|003644-T1 720740 | 50 | 11336976 | 11337034 | Adineta vaga 104782 | AAA|GTAAAATCAG...TATTTTTTGAGT/TATTTTTTGAGT...TATAG|ATT | 2 | 1 | 88.816 |
3821502 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|003644-T1 720740 | 51 | 11337231 | 11337295 | Adineta vaga 104782 | AAA|GTAAGACATA...TTAGTTTTGATT/TTAGTTTTGATT...TTTAG|GGT | 0 | 1 | 90.85 |
3821503 | GT-AG | 0 | 0.0169864366284147 | 52 | rna-gnl|I4U23|003644-T1 720740 | 52 | 11337533 | 11337584 | Adineta vaga 104782 | AAA|GTATTTTGCA...GTTTCTATATTC/TCTATATTCAGT...AATAG|GAT | 0 | 1 | 93.308 |
3821504 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-gnl|I4U23|003644-T1 720740 | 53 | 11337755 | 11337822 | Adineta vaga 104782 | AAG|GTAAAATGTT...TTTTTTCTATCC/AAACTGTTGATT...TGTAG|AAA | 2 | 1 | 95.072 |
3821505 | GT-AG | 0 | 7.025871367131374e-05 | 64 | rna-gnl|I4U23|003644-T1 720740 | 54 | 11337938 | 11338001 | Adineta vaga 104782 | AAT|GTAAGTTTTA...TACTACGTGATT/AAAAAATTAACT...GTTAG|AGA | 0 | 1 | 96.265 |
3821506 | GT-AG | 0 | 0.0007267475347547 | 51 | rna-gnl|I4U23|003644-T1 720740 | 55 | 11338100 | 11338150 | Adineta vaga 104782 | AAA|GTAATTTTTC...TCTCTATTGATT/TTATGATTTATT...CGTAG|ACA | 2 | 1 | 97.282 |
3821507 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|003644-T1 720740 | 56 | 11338284 | 11338336 | Adineta vaga 104782 | AAA|GTAAATGGAA...CATTTCTAAAAT/TTTGATTTCATT...TTTAG|GAA | 0 | 1 | 98.662 |
3821508 | GT-AG | 0 | 0.0604196012161396 | 48 | rna-gnl|I4U23|003644-T1 720740 | 57 | 11338418 | 11338465 | Adineta vaga 104782 | CGA|GTATGTTATT...TTACTTTTATCT/TTTACTTTTATC...TTTAG|TAT | 0 | 1 | 99.502 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);