introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
38 rows where transcript_id = 720737
This data as json, CSV (advanced)
Suggested facets: score, length, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821338 | GT-AG | 0 | 0.0001014440577335 | 356 | rna-gnl|I4U23|000529-T1 720737 | 3 | 1978954 | 1979309 | Adineta vaga 104782 | AAA|GTACGTATAT...ATGATGTTAACA/ATGATGTTAACA...TATAG|ACG | 0 | 1 | 9.958 |
3821339 | GT-AG | 0 | 0.0002986800209888 | 1023 | rna-gnl|I4U23|000529-T1 720737 | 4 | 1977894 | 1978916 | Adineta vaga 104782 | ATG|GTATAGATCA...ATTTCTCTAATT/ATTTCTCTAATT...TCTAG|CCA | 1 | 1 | 10.283 |
3821340 | GT-AG | 0 | 0.0001971096625647 | 560 | rna-gnl|I4U23|000529-T1 720737 | 5 | 1977137 | 1977696 | Adineta vaga 104782 | AAT|GTAAGTTCCG...TTTTTCTTCTCT/ATTTTTCTAAAA...TTTAG|ACA | 0 | 1 | 12.014 |
3821341 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|000529-T1 720737 | 6 | 1976145 | 1976203 | Adineta vaga 104782 | CCG|GTAAAGTCGA...AATTTCATAATT/TTTTTCGTCATT...TTCAG|CAT | 0 | 1 | 20.214 |
3821342 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-gnl|I4U23|000529-T1 720737 | 7 | 1975725 | 1975897 | Adineta vaga 104782 | TTG|GTAAGTACAT...TAAGTTTGAATG/TTGCAATTAACT...TTTAG|ATG | 1 | 1 | 22.385 |
3821343 | GT-AG | 0 | 3.342101271363912e-05 | 88 | rna-gnl|I4U23|000529-T1 720737 | 8 | 1975485 | 1975572 | Adineta vaga 104782 | AAA|GTAAATATCA...TTTGTCTTTTTT/AAATAATTCATT...TTTAG|AAT | 0 | 1 | 23.721 |
3821344 | GT-AG | 0 | 0.0074820192715245 | 178 | rna-gnl|I4U23|000529-T1 720737 | 9 | 1974349 | 1974526 | Adineta vaga 104782 | ATC|GTATGTAGAT...AGTCTCTTGAAC/AGTCTCTTGAAC...TTTAG|TTA | 1 | 1 | 32.141 |
3821345 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|000529-T1 720737 | 10 | 1973381 | 1973436 | Adineta vaga 104782 | AAG|GTAAGTGATC...CATTCTATATCA/TATCATTTCATT...TCTAG|ATT | 1 | 1 | 40.156 |
3821346 | GT-AG | 0 | 0.0001030678153742 | 64 | rna-gnl|I4U23|000529-T1 720737 | 11 | 1973210 | 1973273 | Adineta vaga 104782 | CAA|GTAAATTCTT...TTTCTATTAATT/TTTCTATTAATT...CCTAG|GAT | 0 | 1 | 41.097 |
3821347 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|000529-T1 720737 | 12 | 1972966 | 1973019 | Adineta vaga 104782 | AAG|GTAAAAATTC...TAATTCTTTTTT/CATTGATTAATT...TTCAG|AGG | 1 | 1 | 42.767 |
3821348 | GT-AG | 0 | 0.0001206723426297 | 64 | rna-gnl|I4U23|000529-T1 720737 | 13 | 1972513 | 1972576 | Adineta vaga 104782 | AAA|GTTTGTTCAT...CTTCTATTAACT/CTTCTATTAACT...CGTAG|GGC | 0 | 1 | 46.186 |
3821349 | GT-AG | 0 | 0.0053504559425914 | 65 | rna-gnl|I4U23|000529-T1 720737 | 14 | 1971668 | 1971732 | Adineta vaga 104782 | GCG|GTATGTTATC...TTTTTTTTGTTC/ATTGAGTTAATT...TTCAG|GTT | 0 | 1 | 53.041 |
3821350 | GT-AG | 0 | 0.0219403608220482 | 56 | rna-gnl|I4U23|000529-T1 720737 | 15 | 1971466 | 1971521 | Adineta vaga 104782 | GAA|GTATGTCTTA...TTTTCTTTGTTC/AAATCAATCAAT...TTTAG|AAT | 2 | 1 | 54.324 |
3821351 | GT-AG | 0 | 0.000122005704933 | 56 | rna-gnl|I4U23|000529-T1 720737 | 16 | 1971160 | 1971215 | Adineta vaga 104782 | GAA|GTAAAATCTT...ATTTTTTTAATT/ATTTTTTTAATT...TTTAG|ACG | 0 | 1 | 56.521 |
3821352 | GT-AG | 0 | 0.012999652466682 | 58 | rna-gnl|I4U23|000529-T1 720737 | 17 | 1971062 | 1971119 | Adineta vaga 104782 | CAC|GTATAATTTA...TTTTCTTTTTCA/TTCTTTTTCAAT...TGTAG|GTA | 1 | 1 | 56.873 |
3821353 | GT-AG | 0 | 0.0065067893896614 | 54 | rna-gnl|I4U23|000529-T1 720737 | 18 | 1970776 | 1970829 | Adineta vaga 104782 | TCT|GTTTTTTCAT...CTATTCTTCATT/CTATTCTTCATT...TTCAG|AAT | 2 | 1 | 58.912 |
3821354 | GT-AG | 0 | 0.0014211092448276 | 62 | rna-gnl|I4U23|000529-T1 720737 | 19 | 1970384 | 1970445 | Adineta vaga 104782 | TCG|GTTTGTTGAA...TTTTTCTTATTT/GTTTTTCTTATT...TATAG|AAA | 2 | 1 | 61.812 |
3821355 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|000529-T1 720737 | 20 | 1970234 | 1970290 | Adineta vaga 104782 | TTG|GTAATTGATT...TCTGTTTTAATC/TCTGTTTTAATC...TTTAG|GAC | 2 | 1 | 62.63 |
3821356 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|000529-T1 720737 | 21 | 1969878 | 1969942 | Adineta vaga 104782 | TAA|GTAAGTGATA...TTTTTTTTCTTT/TAAAAACTGATG...TGTAG|AAG | 2 | 1 | 65.187 |
3821357 | GT-AG | 0 | 3.755159858756654e-05 | 86 | rna-gnl|I4U23|000529-T1 720737 | 22 | 1969467 | 1969552 | Adineta vaga 104782 | CAG|GTTTGTTTGA...TAATTTATAATA/ATTTGAATAATT...TTTAG|GAT | 0 | 1 | 68.044 |
3821358 | GT-AG | 0 | 2.2861386847531656e-05 | 57 | rna-gnl|I4U23|000529-T1 720737 | 23 | 1968693 | 1968749 | Adineta vaga 104782 | TAT|GTAAGTTATC...AGTTTTTTGTTC/GTAAATTTCATC...TATAG|CGT | 0 | 1 | 74.345 |
3821359 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|000529-T1 720737 | 24 | 1968517 | 1968569 | Adineta vaga 104782 | CAA|GTAAGATATT...ATTTCATTATTG/GCAAATTTCATT...TCTAG|GAA | 0 | 1 | 75.426 |
3821360 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|000529-T1 720737 | 25 | 1968354 | 1968411 | Adineta vaga 104782 | GAA|GTAAGAAATC...TTTTTTTTGATG/TTTTTTTTGATG...TCTAG|ACT | 0 | 1 | 76.349 |
3821361 | GT-AG | 0 | 0.0021487126065872 | 53 | rna-gnl|I4U23|000529-T1 720737 | 26 | 1967696 | 1967748 | Adineta vaga 104782 | TTT|GTAAGTTTAT...TCCTTCTTATCG/AATTATTTGATT...TTTAG|ACG | 2 | 1 | 81.666 |
3821362 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|000529-T1 720737 | 27 | 1967447 | 1967498 | Adineta vaga 104782 | GAG|GTAGGAAAAT...TTATCTTTTTTT/AAATCATTTATC...TAAAG|GAT | 1 | 1 | 83.398 |
3821363 | GT-AG | 0 | 1.2359919860935172e-05 | 60 | rna-gnl|I4U23|000529-T1 720737 | 28 | 1967282 | 1967341 | Adineta vaga 104782 | CGA|GTTAGTTTTT...TTCTTCTTATGT/TTTCTTCTTATG...TTTAG|GTA | 1 | 1 | 84.321 |
3821364 | GT-AG | 0 | 4.942518363821918e-05 | 51 | rna-gnl|I4U23|000529-T1 720737 | 29 | 1967119 | 1967169 | Adineta vaga 104782 | TAT|GTAAGTTATT...GGATTTTCGATA/GATAGTTTCATA...TTTAG|ACA | 2 | 1 | 85.305 |
3821365 | GT-AG | 0 | 0.0117559664065535 | 60 | rna-gnl|I4U23|000529-T1 720737 | 30 | 1966963 | 1967022 | Adineta vaga 104782 | TCA|GTATGTGTTT...TCATTTTTATAG/GATATTTTCATG...TTTAG|ATT | 2 | 1 | 86.149 |
3821366 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|000529-T1 720737 | 31 | 1966875 | 1966933 | Adineta vaga 104782 | TTG|GTAAGTACAT...TTCTTTTTGAAA/TTCTTTTTGAAA...TTTAG|ATT | 1 | 1 | 86.404 |
3821367 | GT-AG | 0 | 1.6781246144367076e-05 | 50 | rna-gnl|I4U23|000529-T1 720737 | 32 | 1966664 | 1966713 | Adineta vaga 104782 | ACT|GTAAGTTAAT...TGTATTTTTCTA/GAATAGTTGATA...CTTAG|ATT | 0 | 1 | 87.819 |
3821368 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|000529-T1 720737 | 33 | 1966435 | 1966501 | Adineta vaga 104782 | CAA|GTAAGAAATT...AGTCCATTGACT/CATTGACTTACT...TTCAG|TTA | 0 | 1 | 89.242 |
3821369 | GT-AG | 0 | 0.0906311213870943 | 67 | rna-gnl|I4U23|000529-T1 720737 | 34 | 1966074 | 1966140 | Adineta vaga 104782 | AGT|GTATTTTATT...TTTCTCTTTTTT/ATTACTTTTATG...TATAG|GGT | 0 | 1 | 91.826 |
3821370 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-gnl|I4U23|000529-T1 720737 | 35 | 1965894 | 1965941 | Adineta vaga 104782 | ATG|GTAAAATAAT...TTTTCTTTGTCA/TTCTTTGTCACA...TTTAG|ATC | 0 | 1 | 92.986 |
3821371 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|000529-T1 720737 | 36 | 1965627 | 1965680 | Adineta vaga 104782 | CCT|GTAAGTAAAA...TTTTCTTTCATA/TTTTCTTTCATA...TTTAG|CAA | 0 | 1 | 94.858 |
3821372 | GT-AG | 0 | 0.000101376791671 | 49 | rna-gnl|I4U23|000529-T1 720737 | 37 | 1965419 | 1965467 | Adineta vaga 104782 | CAA|GTAAATCTTT...TAAGTTTTATCG/TTAAGTTTTATC...TGTAG|TTA | 0 | 1 | 96.256 |
3821373 | GT-AG | 0 | 1.4657917319067456e-05 | 54 | rna-gnl|I4U23|000529-T1 720737 | 38 | 1965079 | 1965132 | Adineta vaga 104782 | ATG|GTAAAATTGA...TATTCATTAATT/CCAATATTCATT...TCTAG|ATA | 1 | 1 | 98.77 |
3842419 | GT-AG | 0 | 1.000000099473604e-05 | 69 | rna-gnl|I4U23|000529-T1 720737 | 1 | 1980903 | 1980971 | Adineta vaga 104782 | AGT|GTAATGAAAA...ATATCATTGATT/ATATCATTGATT...TTTAG|TGT | 0 | 3.595 | |
3842420 | GT-AG | 0 | 1.000000099473604e-05 | 157 | rna-gnl|I4U23|000529-T1 720737 | 2 | 1980172 | 1980328 | Adineta vaga 104782 | ATG|GTAAGATTCA...CGTATTTTAATC/TTTAATCTAATT...ATTAG|ATT | 0 | 8.639 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);