introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
47 rows where transcript_id = 720734
This data as json, CSV (advanced)
Suggested facets: length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821231 | GT-AG | 0 | 0.0028801406887907 | 70 | rna-gnl|I4U23|001612-T1 720734 | 1 | 4980960 | 4981029 | Adineta vaga 104782 | AAA|GTATGTTGAT...TATATCTTTTTT/ATAGATTTCAAT...TTTAG|TTT | 0 | 1 | 1.07 |
3821232 | GT-AG | 0 | 9.1668394228617e-05 | 57 | rna-gnl|I4U23|001612-T1 720734 | 2 | 4981156 | 4981212 | Adineta vaga 104782 | ACA|GTAAGTATTT...ATAATCTCAATT/CTCAATTTCATA...AATAG|ACA | 0 | 1 | 2.139 |
3821233 | GT-AG | 0 | 1.000000099473604e-05 | 69 | rna-gnl|I4U23|001612-T1 720734 | 3 | 4981333 | 4981401 | Adineta vaga 104782 | AAG|GTAAGATTAG...TTTTTTTTAATC/TTTTTTTTAATC...AATAG|GTA | 0 | 1 | 3.158 |
3821234 | GT-AG | 0 | 0.0096857107256678 | 47 | rna-gnl|I4U23|001612-T1 720734 | 4 | 4981537 | 4981583 | Adineta vaga 104782 | CGA|GTATGTATTT...GTTTTCATATTA/TTTCATATTATA...TCTAG|GAT | 0 | 1 | 4.304 |
3821235 | GT-AG | 0 | 0.0025075483117299 | 53 | rna-gnl|I4U23|001612-T1 720734 | 5 | 4981787 | 4981839 | Adineta vaga 104782 | TAA|GTAAGTTTCT...ATTTCTTTGACT/ATTTCTTTGACT...TCTAG|ACC | 2 | 1 | 6.027 |
3821236 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|001612-T1 720734 | 6 | 4981965 | 4982015 | Adineta vaga 104782 | CTG|GTTTGTATAT...TCGATCTGATAT/GACGAATTGATT...TTTAG|GAG | 1 | 1 | 7.088 |
3821237 | GT-AG | 0 | 9.272851530924876e-05 | 65 | rna-gnl|I4U23|001612-T1 720734 | 7 | 4982271 | 4982335 | Adineta vaga 104782 | ATG|GTAATCATTG...AAACCTTCGATA/TATATATATATA...TTTAG|TCA | 1 | 1 | 9.252 |
3821238 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|001612-T1 720734 | 8 | 4983297 | 4983351 | Adineta vaga 104782 | AAA|GTAAGAACAA...TTCGTATTAATT/TTCGTATTAATT...TAAAG|GAG | 2 | 1 | 17.409 |
3821239 | GT-AG | 0 | 0.000369867749158 | 55 | rna-gnl|I4U23|001612-T1 720734 | 9 | 4983901 | 4983955 | Adineta vaga 104782 | CAA|GTAAACATAG...TTCTTCATAACT/GTTTTCTTCATA...TTTAG|ATT | 2 | 1 | 22.069 |
3821240 | GT-AG | 0 | 4.958674028858663e-05 | 52 | rna-gnl|I4U23|001612-T1 720734 | 10 | 4984824 | 4984875 | Adineta vaga 104782 | CAT|GTAAGTTTAT...TTTCACTTATCT/ATTCTTTTCACT...TGTAG|GAT | 0 | 1 | 29.437 |
3821241 | GT-AG | 0 | 0.0003724568360592 | 74 | rna-gnl|I4U23|001612-T1 720734 | 11 | 4985247 | 4985320 | Adineta vaga 104782 | AGG|GTAAATTTCC...TTTTTTTTATTT/CTTTTTTTTATT...TCTAG|TGA | 2 | 1 | 32.586 |
3821242 | GT-AG | 0 | 0.0010133951985674 | 68 | rna-gnl|I4U23|001612-T1 720734 | 12 | 4986321 | 4986388 | Adineta vaga 104782 | GAA|GTAAATTTAA...TAATTCTTATTT/CTAATTCTTATT...ATTAG|GTG | 0 | 1 | 41.075 |
3821243 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|001612-T1 720734 | 13 | 4986569 | 4986632 | Adineta vaga 104782 | GGA|GTAAGAAATC...TCTTTCTTTTTT/TTGTTGTTCAAT...TTTAG|GAA | 0 | 1 | 42.602 |
3821244 | GT-AG | 0 | 2.510136564741146e-05 | 62 | rna-gnl|I4U23|001612-T1 720734 | 14 | 4986795 | 4986856 | Adineta vaga 104782 | GAC|GTAATTACAT...TTTTTCTAAATG/CTTTTTCTAAAT...TTCAG|ATG | 0 | 1 | 43.978 |
3821245 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|001612-T1 720734 | 15 | 4987124 | 4987180 | Adineta vaga 104782 | AAA|GTGATTTGAT...TTTCTCTTTTCT/TCTTTTCTAATA...TCTAG|GTA | 0 | 1 | 46.244 |
3821246 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|001612-T1 720734 | 16 | 4987284 | 4987341 | Adineta vaga 104782 | ATG|GTAGAGATTA...TTGATATTAAAG/ATTTAATTGATA...TTTAG|ATG | 1 | 1 | 47.118 |
3821247 | GT-AG | 0 | 1.000000099473604e-05 | 186 | rna-gnl|I4U23|001612-T1 720734 | 17 | 4987399 | 4987584 | Adineta vaga 104782 | ATG|GTAAGTAAAA...CATTCTTTATCT/TTAGTTCTAATC...TTCAG|ACG | 1 | 1 | 47.602 |
3821248 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|001612-T1 720734 | 18 | 4987906 | 4987966 | Adineta vaga 104782 | CAG|GTAAAGAATT...AATGTCTTAGTA/GAATGTCTTAGT...TCAAG|GAA | 1 | 1 | 50.327 |
3821249 | GT-AG | 0 | 2.1051782918872317e-05 | 331 | rna-gnl|I4U23|001612-T1 720734 | 19 | 4988180 | 4988510 | Adineta vaga 104782 | TCG|GTAAATTAAA...CGTTTCATAATT/AATTTTCTAACT...TTTAG|CTA | 1 | 1 | 52.135 |
3821250 | GT-AG | 0 | 1.000000099473604e-05 | 69 | rna-gnl|I4U23|001612-T1 720734 | 20 | 4988792 | 4988860 | Adineta vaga 104782 | CAA|GTAAAAATCT...TTCACTTTCATT/TTCACTTTCATT...TTTAG|AGA | 0 | 1 | 54.52 |
3821251 | GT-AG | 0 | 8.122499773300904e-05 | 61 | rna-gnl|I4U23|001612-T1 720734 | 21 | 4989045 | 4989105 | Adineta vaga 104782 | GCC|GTATGAAAAA...GGTTGTTTATCA/TCTATTTTCATG...TTTAG|GAT | 1 | 1 | 56.082 |
3821252 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|001612-T1 720734 | 22 | 4989248 | 4989301 | Adineta vaga 104782 | AAA|GTAGGATGAT...TATGTTTCAAAG/TTATGTTTCAAA...TATAG|AGA | 2 | 1 | 57.287 |
3821253 | GT-AG | 0 | 0.0334644293601936 | 56 | rna-gnl|I4U23|001612-T1 720734 | 23 | 4989371 | 4989426 | Adineta vaga 104782 | ACC|GTATGTATCT...TTTATTTTAGAT/TTTAGATTTATG...GTTAG|GAA | 2 | 1 | 57.873 |
3821254 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|001612-T1 720734 | 24 | 4989637 | 4989694 | Adineta vaga 104782 | TAA|GTAATATGCC...CTTCTTTTATAT/TTATTATTGATG...TTTAG|GAA | 2 | 1 | 59.655 |
3821255 | GT-AG | 0 | 0.0008014785005077 | 58 | rna-gnl|I4U23|001612-T1 720734 | 25 | 4989845 | 4989902 | Adineta vaga 104782 | GAA|GTAAGTTTGC...TTTATCTTAAAA/TTAGCTTTTATC...TCTAG|ATT | 2 | 1 | 60.929 |
3821256 | GT-AG | 0 | 0.0001107138264127 | 214 | rna-gnl|I4U23|001612-T1 720734 | 26 | 4990076 | 4990289 | Adineta vaga 104782 | AAG|GTATAAATTT...TTGTCTTCAATC/TTCAATCTAATT...TCAAG|GAA | 1 | 1 | 62.397 |
3821257 | GT-AG | 0 | 0.0003080053858662 | 72 | rna-gnl|I4U23|001612-T1 720734 | 27 | 4990397 | 4990468 | Adineta vaga 104782 | TGT|GTAAGCAATA...GATTCATTAATT/GATTCATTAATT...TACAG|GAT | 0 | 1 | 63.305 |
3821258 | GT-AG | 0 | 8.369504059119328e-05 | 74 | rna-gnl|I4U23|001612-T1 720734 | 28 | 4990646 | 4990719 | Adineta vaga 104782 | GAG|GTAACAATCA...TAATCTATAATT/CTATAATTGAAT...TTCAG|GTA | 0 | 1 | 64.808 |
3821259 | GT-AG | 0 | 7.983709553601637e-05 | 58 | rna-gnl|I4U23|001612-T1 720734 | 29 | 4990869 | 4990926 | Adineta vaga 104782 | TCC|GTAAGAATTT...TTTGTTTTATTT/GTTTTATTTATT...ATTAG|AAA | 2 | 1 | 66.072 |
3821260 | GT-AG | 0 | 9.104134452542337e-05 | 61 | rna-gnl|I4U23|001612-T1 720734 | 30 | 4991040 | 4991100 | Adineta vaga 104782 | GAG|GTATGTATTT...ATGTGTGTATTT/GATGTGTGTATT...TCTAG|CAG | 1 | 1 | 67.032 |
3821261 | GT-AG | 0 | 0.0014954315137219 | 58 | rna-gnl|I4U23|001612-T1 720734 | 31 | 4991285 | 4991342 | Adineta vaga 104782 | AAA|GTTTGTTTTA...ATTATTTTGCCA/GCCACATTTATT...CAAAG|AAT | 2 | 1 | 68.593 |
3821262 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|001612-T1 720734 | 32 | 4991526 | 4991581 | Adineta vaga 104782 | TCG|GTTAGTTCTG...CAATTCTTAATT/TTAAAATTCATT...TTTAG|TAC | 2 | 1 | 70.147 |
3821263 | GT-AG | 0 | 0.0001803705117628 | 50 | rna-gnl|I4U23|001612-T1 720734 | 33 | 4991702 | 4991751 | Adineta vaga 104782 | TAA|GTAGGTTGTA...GAATTTTTACTC/TTTTTACTCAAT...TTTAG|AAT | 2 | 1 | 71.165 |
3821264 | GT-AG | 0 | 0.0003742569664783 | 58 | rna-gnl|I4U23|001612-T1 720734 | 34 | 4991918 | 4991975 | Adineta vaga 104782 | AAA|GTATTGAAAG...ATTATCTTAAAA/CATGTTTTGATT...TTTAG|CCA | 0 | 1 | 72.574 |
3821265 | GT-AG | 0 | 2.0519513047024065e-05 | 64 | rna-gnl|I4U23|001612-T1 720734 | 35 | 4992149 | 4992212 | Adineta vaga 104782 | TCG|GTAAATATCT...TTGTTCTTTTTT/ATTTCTATCATT...TGTAG|AAA | 2 | 1 | 74.043 |
3821266 | GT-AG | 0 | 0.0002594974656219 | 64 | rna-gnl|I4U23|001612-T1 720734 | 36 | 4992433 | 4992496 | Adineta vaga 104782 | GAA|GTAAGTTTTC...TGTCTATTGATT/TGTCTATTGATT...CGTAG|ATT | 0 | 1 | 75.91 |
3821267 | GT-AG | 0 | 0.0131635132816971 | 62 | rna-gnl|I4U23|001612-T1 720734 | 37 | 4992697 | 4992758 | Adineta vaga 104782 | TAA|GTATGATTTC...TTTCCGTTAAAC/TATGAATTTATT...ATTAG|ATT | 2 | 1 | 77.608 |
3821268 | GT-AG | 0 | 1.3153549536667632e-05 | 128 | rna-gnl|I4U23|001612-T1 720734 | 38 | 4992880 | 4993007 | Adineta vaga 104782 | GTT|GTAAGAACAT...TTTTCTTTGATT/TTTTCTTTGATT...CCAAG|ATT | 0 | 1 | 78.635 |
3821269 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|001612-T1 720734 | 39 | 4993274 | 4993335 | Adineta vaga 104782 | CAA|GTGAGTCTTT...CTTTTTTTGAAT/CTTTTTTTGAAT...ATTAG|ATT | 2 | 1 | 80.893 |
3821270 | GT-AG | 0 | 1.4376994730023475e-05 | 56 | rna-gnl|I4U23|001612-T1 720734 | 40 | 4993498 | 4993553 | Adineta vaga 104782 | GAA|GTAAGTTAAC...CATTTTTCAATC/TATCAATTTACG...TCTAG|AGA | 2 | 1 | 82.268 |
3821271 | GT-AG | 0 | 4.437910745522615e-05 | 49 | rna-gnl|I4U23|001612-T1 720734 | 41 | 4993693 | 4993741 | Adineta vaga 104782 | GAA|GTAATTTCAT...TCTATTTTGAAA/TACTTATTGATC...TTTAG|GAA | 0 | 1 | 83.448 |
3821272 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|001612-T1 720734 | 42 | 4993913 | 4993970 | Adineta vaga 104782 | GAT|GTAAGTGATT...TTTTCATTATCA/CTTGTTTTCATT...AATAG|TCC | 0 | 1 | 84.899 |
3821273 | GT-AG | 0 | 0.0005970607749378 | 63 | rna-gnl|I4U23|001612-T1 720734 | 43 | 4994079 | 4994141 | Adineta vaga 104782 | AAA|GTAAATTTTT...TAAATTTTATTT/TTAAATTTTATT...TCTAG|GTA | 0 | 1 | 85.816 |
3821274 | GT-AG | 0 | 0.0011868429181465 | 52 | rna-gnl|I4U23|001612-T1 720734 | 44 | 4994532 | 4994583 | Adineta vaga 104782 | AAA|GTAAACTTTT...TACAATTTAAAT/TTGTCAATTATT...TCCAG|GTT | 0 | 1 | 89.127 |
3821275 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|001612-T1 720734 | 45 | 4994963 | 4995026 | Adineta vaga 104782 | CAG|GTAAAATAAT...TTTTCCATGATG/CAAATGATCAAT...AACAG|TTT | 1 | 1 | 92.344 |
3821276 | GT-AG | 0 | 9.59719016716376e-05 | 63 | rna-gnl|I4U23|001612-T1 720734 | 46 | 4995189 | 4995251 | Adineta vaga 104782 | CCG|GTAATTATTC...CTATTCTTAAAT/TCTATTCTTAAA...TCTAG|GTT | 1 | 1 | 93.719 |
3821277 | GT-AG | 0 | 9.619373368435212e-05 | 49 | rna-gnl|I4U23|001612-T1 720734 | 47 | 4995640 | 4995688 | Adineta vaga 104782 | AGA|GTAAATATTT...TATGATTTAATT/TATGATTTAATT...TTTAG|AGT | 2 | 1 | 97.012 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);