introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
37 rows where transcript_id = 720746
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821604 | GT-AG | 0 | 1.000000099473604e-05 | 189 | rna-gnl|I4U23|004289-T1 720746 | 1 | 13187935 | 13188123 | Adineta vaga 104782 | GAG|GTGAGAATCT...TGCACCTAAACT/TTTCTTTTCATA...AATAG|GTG | 1 | 1 | 0.195 |
3821605 | GT-AG | 0 | 0.0005322260809284 | 2056 | rna-gnl|I4U23|004289-T1 720746 | 2 | 13185675 | 13187730 | Adineta vaga 104782 | AAC|GTGTACCGCT...AATATGTTAACA/AATATGTTAACA...TATAG|GTA | 1 | 1 | 2.677 |
3821606 | GT-AG | 0 | 9.726837981489124e-05 | 826 | rna-gnl|I4U23|004289-T1 720746 | 3 | 13184345 | 13185170 | Adineta vaga 104782 | ATG|GTGTTTTTGT...TTTTTTTTATGA/CTTTTTTTTATG...TATAG|GTA | 1 | 1 | 8.811 |
3821607 | GT-AG | 0 | 0.1944225981629561 | 715 | rna-gnl|I4U23|004289-T1 720746 | 4 | 13183126 | 13183840 | Adineta vaga 104782 | ATG|GTATTTTTGT...TGTTTTTTAAAA/AAAATATTTATT...TTTAG|ATA | 1 | 1 | 14.945 |
3821608 | GT-AG | 0 | 1.000000099473604e-05 | 502 | rna-gnl|I4U23|004289-T1 720746 | 5 | 13182068 | 13182569 | Adineta vaga 104782 | CTG|GTCAAATAAC...ATAATTTTATTT/ATTTTATTTATA...TATAG|TAT | 2 | 1 | 21.711 |
3821609 | GT-AG | 0 | 1.919917274065151e-05 | 47 | rna-gnl|I4U23|004289-T1 720746 | 6 | 13181893 | 13181939 | Adineta vaga 104782 | AAG|GTTTGTTTGA...TAAACCTCACTA/TTAAACCTCACT...TCTAG|GAC | 1 | 1 | 23.269 |
3821610 | GT-AG | 0 | 0.000803921912047 | 47 | rna-gnl|I4U23|004289-T1 720746 | 7 | 13181717 | 13181763 | Adineta vaga 104782 | AAG|GTTTGTTTTA...AAAATTTTAACT/AAAATTTTAACT...TCTAG|AAC | 1 | 1 | 24.839 |
3821611 | GT-AG | 0 | 1.919917274065151e-05 | 47 | rna-gnl|I4U23|004289-T1 720746 | 8 | 13181529 | 13181575 | Adineta vaga 104782 | AAG|GTTTGTTTGA...TAAACCTCACTA/TTAAACCTCACT...TCTAG|GAC | 1 | 1 | 26.555 |
3821612 | GT-AG | 0 | 9.440537893100389e-05 | 47 | rna-gnl|I4U23|004289-T1 720746 | 9 | 13181335 | 13181381 | Adineta vaga 104782 | AAG|GTTTGTTTCA...AGTTTATTAAAA/TTAAAGTTTATT...TCTAG|AAC | 1 | 1 | 28.344 |
3821613 | GT-AG | 0 | 1.4717849562521769e-05 | 47 | rna-gnl|I4U23|004289-T1 720746 | 10 | 13181147 | 13181193 | Adineta vaga 104782 | AAG|GTTTGTTTCA...CACATATTAAAG/TTAAAGTTCACT...TCTAG|GAC | 1 | 1 | 30.06 |
3821614 | GT-AG | 0 | 6.460338042554517e-05 | 36 | rna-gnl|I4U23|004289-T1 720746 | 11 | 13180985 | 13181020 | Adineta vaga 104782 | CTC|GTAAGTGTTG...TCATTTTTAGAT/TTCATTTTTAGA...TACAG|ATG | 1 | 1 | 31.593 |
3821615 | GT-AG | 0 | 0.0286311257592399 | 50 | rna-gnl|I4U23|004289-T1 720746 | 12 | 13180462 | 13180511 | Adineta vaga 104782 | ACT|GTAACTAATT...TTTTTCTTTTCT/ATTTGAATTATT...TATAG|AAT | 0 | 1 | 37.349 |
3821616 | GT-AG | 0 | 0.0024462584221878 | 62 | rna-gnl|I4U23|004289-T1 720746 | 13 | 13180288 | 13180349 | Adineta vaga 104782 | CTA|GTATGTATTA...AAAATTTCAAAT/AAATAACTGACA...TATAG|ATA | 1 | 1 | 38.712 |
3821617 | GT-AG | 0 | 0.001564421028387 | 72 | rna-gnl|I4U23|004289-T1 720746 | 14 | 13180020 | 13180091 | Adineta vaga 104782 | GGC|GTAAGTTCTA...TCTTCTTTATTT/TTCTTCTTTATT...TATAG|TTG | 2 | 1 | 41.098 |
3821618 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|004289-T1 720746 | 15 | 13179750 | 13179810 | Adineta vaga 104782 | TCG|GTAAATACAT...TTTTTTTTCACT/TTTTTTTTCACT...TTAAG|GAA | 1 | 1 | 43.641 |
3821619 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|004289-T1 720746 | 16 | 13179441 | 13179494 | Adineta vaga 104782 | ATG|GTAAATGTTT...TGAATCTCATTT/TTGAATCTCATT...TATAG|CTT | 1 | 1 | 46.745 |
3821620 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|004289-T1 720746 | 17 | 13179053 | 13179101 | Adineta vaga 104782 | GAA|GTAATGAAAT...TTTTTCTTATTT/ATTTTTCTTATT...TACAG|ATG | 1 | 1 | 50.87 |
3821621 | GT-AG | 0 | 0.0025615592950668 | 55 | rna-gnl|I4U23|004289-T1 720746 | 18 | 13178939 | 13178993 | Adineta vaga 104782 | ATC|GTAAGTTTTA...TTTTCTTCAATT/CTTCAATTCATT...TTAAG|TCT | 0 | 1 | 51.588 |
3821622 | GT-AG | 0 | 0.2043614665817523 | 61 | rna-gnl|I4U23|004289-T1 720746 | 19 | 13178683 | 13178743 | Adineta vaga 104782 | TCA|GTATGATTTT...TTTTCTTTAGAC/ATTTTCTTTAGA...AATAG|ATA | 0 | 1 | 53.961 |
3821623 | GT-AG | 0 | 2.5615874532949477e-05 | 56 | rna-gnl|I4U23|004289-T1 720746 | 20 | 13178501 | 13178556 | Adineta vaga 104782 | ACT|GTAAGATTAT...TTTTATTTGATA/TTTTATTTGATA...TCAAG|GCA | 0 | 1 | 55.495 |
3821624 | GT-AG | 0 | 0.0001482410609101 | 57 | rna-gnl|I4U23|004289-T1 720746 | 21 | 13178329 | 13178385 | Adineta vaga 104782 | CAA|GTAAGTTTTA...TGCTTTTCAATT/ATGCTTTTCAAT...TAAAG|AAT | 1 | 1 | 56.894 |
3821625 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|004289-T1 720746 | 22 | 13178196 | 13178251 | Adineta vaga 104782 | ACA|GTAAGAAATT...TTGTTGTTGAAT/TTGTTGTTGAAT...TAAAG|GCA | 0 | 1 | 57.831 |
3821626 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|004289-T1 720746 | 23 | 13177902 | 13177955 | Adineta vaga 104782 | CGA|GTAAAAGAAC...CTCTCCTTTTAA/TCAACATTCAAA...TTTAG|TCA | 0 | 1 | 60.752 |
3821627 | GT-AG | 0 | 0.0005067932050494 | 42 | rna-gnl|I4U23|004289-T1 720746 | 24 | 13177832 | 13177873 | Adineta vaga 104782 | TAG|GTAACTCTCC...TAATCTTTCGAG/CATATACTAATC...ATTAG|TTT | 1 | 1 | 61.093 |
3821628 | GT-AG | 0 | 2.536940917018056e-05 | 53 | rna-gnl|I4U23|004289-T1 720746 | 25 | 13177458 | 13177510 | Adineta vaga 104782 | TGA|GTAAGTATAT...TTTGTTTAAATG/GACTATTTCAAA...TTTAG|ATT | 1 | 1 | 64.999 |
3821629 | GT-AG | 0 | 1.138155158651926e-05 | 61 | rna-gnl|I4U23|004289-T1 720746 | 26 | 13177143 | 13177203 | Adineta vaga 104782 | GTT|GTAAGTAACA...CAAATTTTAGAA/AAAAAACTTACG...TTAAG|CTT | 0 | 1 | 68.091 |
3821630 | GT-AG | 0 | 7.869656491206925e-05 | 52 | rna-gnl|I4U23|004289-T1 720746 | 27 | 13176863 | 13176914 | Adineta vaga 104782 | CAA|GTAAATTCAT...ATTTCATTATTT/AATCATTTCATT...TTTAG|TTA | 0 | 1 | 70.865 |
3821631 | GT-AG | 0 | 5.076956419316885e-05 | 58 | rna-gnl|I4U23|004289-T1 720746 | 28 | 13176703 | 13176760 | Adineta vaga 104782 | AAA|GTAGAATTTG...TTTGATTTGATT/TTTGATTTGATT...TAAAG|GTT | 0 | 1 | 72.107 |
3821632 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|004289-T1 720746 | 29 | 13176536 | 13176586 | Adineta vaga 104782 | AAA|GTAAATCATT...TAAAATTTGATT/TAAAATTTGATT...TTTAG|ACC | 2 | 1 | 73.518 |
3821633 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|004289-T1 720746 | 30 | 13176318 | 13176369 | Adineta vaga 104782 | GCA|GTTAGTAGAA...ATATTTTTGATT/ATATTTTTGATT...AATAG|ATC | 0 | 1 | 75.539 |
3821634 | GT-AG | 0 | 0.0004905581408024 | 444 | rna-gnl|I4U23|004289-T1 720746 | 31 | 13175773 | 13176216 | Adineta vaga 104782 | ATG|GTTTTCAATA...TCAATTTTATTT/ATTTTATTTATC...CGAAG|TAT | 2 | 1 | 76.768 |
3821635 | GT-AG | 0 | 4.368114689520784e-05 | 58 | rna-gnl|I4U23|004289-T1 720746 | 32 | 13175624 | 13175681 | Adineta vaga 104782 | CAA|GTATGAAAGA...TTGTTTTTTGCT/TTTTTGCTCAAA...TATAG|ATA | 0 | 1 | 77.875 |
3821636 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|004289-T1 720746 | 33 | 13175451 | 13175509 | Adineta vaga 104782 | GAA|GTTTGTAGAA...TTCTATTTAGTT/ATTTAGTTCATT...GTTAG|GAT | 0 | 1 | 79.263 |
3821637 | GT-AG | 0 | 0.0002100040020742 | 55 | rna-gnl|I4U23|004289-T1 720746 | 34 | 13175150 | 13175204 | Adineta vaga 104782 | AAA|GTATGAAAAG...TTTCTTTTGATT/AGATTTTTCACT...AATAG|ATT | 0 | 1 | 82.256 |
3821638 | GT-AG | 0 | 1.21845793868354e-05 | 52 | rna-gnl|I4U23|004289-T1 720746 | 35 | 13174425 | 13174476 | Adineta vaga 104782 | AAG|GTTTGTAAGA...TTTATCTTAAAC/TCTATTTTTATC...TTTAG|ATT | 1 | 1 | 90.447 |
3821639 | GT-AG | 0 | 0.0041518880788407 | 93 | rna-gnl|I4U23|004289-T1 720746 | 36 | 13173747 | 13173839 | Adineta vaga 104782 | TTT|GTTTTATTTT...TAGTTATTGATA/TAGTTATTGATA...AAGAG|CGA | 1 | 1 | 97.566 |
3821640 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|004289-T1 720746 | 37 | 13173499 | 13173555 | Adineta vaga 104782 | CGA|GTAAATATAT...TGCTTTATAAAT/AATTTATTCATT...TTTAG|ATT | 0 | 1 | 99.89 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);