introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
50 rows where transcript_id = 3485097
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
17202984 | GT-AG | 0 | 1.000000099473604e-05 | 1609 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 48 | 132061 | 133669 | Armadillidium vulgare 13347 | GCG|GTAAGTGAAA...TAATTCGTTTCT/ATAATAATAATT...TCTAG|AAT | 0 | 1 | 95.119 |
17202985 | GT-AG | 0 | 1.000000099473604e-05 | 505 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 49 | 133853 | 134357 | Armadillidium vulgare 13347 | GAA|GTAAGAATAA...TTTTTTGTATTA/AAGTTTCTAATA...TACAG|ATC | 0 | 1 | 97.003 |
17202986 | GT-AG | 0 | 1.000000099473604e-05 | 959 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 50 | 134484 | 135442 | Armadillidium vulgare 13347 | AAG|GTAAATCATT...TTTTTTTTAAAA/AATGTTCTTACA...TTCAG|TCT | 0 | 1 | 98.301 |
17202987 | GT-AG | 0 | 0.004140556711115 | 187 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 1 | 94753 | 94939 | Armadillidium vulgare 13347 | TCT|GTAAGTTTTT...TCTTTCCTAATA/TCTTTCCTAATA...AGCAG|AAG | 0 | 1.606 | |
17202988 | GT-AG | 0 | 0.0001716510893382 | 264 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 2 | 95064 | 95327 | Armadillidium vulgare 13347 | TCG|GTAGGTATTG...TGTTTTTTATTG/GTGTTTTTTATT...AACAG|AGA | 0 | 2.883 | |
17202989 | GT-TG | 0 | 0.0058459010802162 | 508 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 3 | 95508 | 96015 | Armadillidium vulgare 13347 | GAG|GTATCAGTCA...GTAAATTTAATG/TGTAAATTTAAT...ATTTG|CCT | 0 | 4.737 | |
17202990 | GT-AG | 0 | 0.0109861604735161 | 175 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 4 | 96121 | 96295 | Armadillidium vulgare 13347 | CAG|GTACTCTTTC...AATATTTTATAT/TAATATTTTATA...TATAG|AAT | 0 | 5.818 | |
17202991 | GT-AG | 0 | 1.000000099473604e-05 | 211 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 5 | 96435 | 96645 | Armadillidium vulgare 13347 | GAG|GTTTGAACTA...ACATTATTAACT/ACATTATTAACT...ATCAG|ATC | 0 | 7.25 | |
17202992 | GT-AG | 0 | 1.000000099473604e-05 | 289 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 6 | 96823 | 97111 | Armadillidium vulgare 13347 | AAG|GTAGGCGAAT...CATGCTTTATTA/CAAAAACTTATT...TTCAG|GAG | 0 | 9.072 | |
17202993 | GT-AG | 0 | 1.000000099473604e-05 | 158 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 7 | 97316 | 97473 | Armadillidium vulgare 13347 | CAG|GTAAATATAT...TATATATTATAT/ATATATATTATA...GTTAG|GTG | 0 | 11.173 | |
17202994 | GT-AG | 0 | 1.000000099473604e-05 | 822 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 8 | 97595 | 98416 | Armadillidium vulgare 13347 | CAA|GTAAGATATA...AACTCCTTTGTT/CGGATGTTAACT...TACAG|GGG | 0 | 12.419 | |
17202995 | GT-AG | 0 | 1.000000099473604e-05 | 228 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 9 | 98706 | 98933 | Armadillidium vulgare 13347 | AAG|GTAATCCAAT...AAGATTTTATGG/TTGTAAATTATT...GACAG|ATC | 0 | 15.395 | |
17202996 | GT-AG | 0 | 1.000000099473604e-05 | 507 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 10 | 99117 | 99623 | Armadillidium vulgare 13347 | AAG|GTAATTATTC...TAATTTATACTT/TAATAATTTATA...TTTAG|GTT | 0 | 17.279 | |
17202997 | GT-AG | 0 | 1.000000099473604e-05 | 806 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 11 | 99808 | 100613 | Armadillidium vulgare 13347 | AAG|GTGAAGAATC...CCCTCCTTAAAA/TTTTTCTTTATG...CTCAG|CCG | 0 | 19.174 | |
17202998 | GT-AG | 0 | 0.0002787420444148 | 92 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 12 | 100737 | 100828 | Armadillidium vulgare 13347 | AAG|GTAAACTTAC...ATAACTTTATTT/TCTTGTCTAATC...TTTAG|GAC | 0 | 20.441 | |
17202999 | GT-AG | 0 | 1.000000099473604e-05 | 108 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 13 | 100997 | 101104 | Armadillidium vulgare 13347 | TAG|GTAAAGATTT...ATTCTTTTAATG/CTTTTTTTCACT...TTTAG|TTT | 0 | 22.171 | |
17203000 | GT-AG | 0 | 0.0001666913263659 | 629 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 14 | 101243 | 101871 | Armadillidium vulgare 13347 | CAG|GTAACGTTGC...TATTATTTAAAT/TATTATTTAAAT...TGTAG|GGT | 0 | 23.592 | |
17203001 | GT-AG | 0 | 0.0027118438900494 | 173 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 15 | 101992 | 102164 | Armadillidium vulgare 13347 | CAG|GTATTGTCAA...AATTCCTTAATG/TAATTCCTTAAT...AACAG|AAG | 0 | 24.828 | |
17203002 | GT-AG | 0 | 0.0004757821851049 | 138 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 16 | 102345 | 102482 | Armadillidium vulgare 13347 | ACC|GTAAGTTCAT...TTTTCTTTAAAT/TAATTTCTTATT...TATAG|GTT | 0 | 26.681 | |
17203003 | GT-AG | 0 | 1.9001180164845065e-05 | 215 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 17 | 102616 | 102830 | Armadillidium vulgare 13347 | AGT|GTAAGTACTA...TATTCATTATTT/AATTTATTCATT...TTTAG|GTT | 0 | 28.051 | |
17203004 | GT-AG | 0 | 0.001445719228711 | 87 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 18 | 102973 | 103059 | Armadillidium vulgare 13347 | AGG|GTATTTATCT...CGATTTATAACA/AAGTTTATCACC...TTTAG|GAA | 0 | 29.513 | |
17203005 | GT-AG | 0 | 1.000000099473604e-05 | 1099 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 19 | 103269 | 104367 | Armadillidium vulgare 13347 | TCA|GTAAGTTACA...ATATATATATTG/TATATATATATT...TGTAG|ATT | 0 | 31.665 | |
17203006 | GT-AG | 0 | 0.0109923526474131 | 489 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 20 | 104586 | 105074 | Armadillidium vulgare 13347 | GAA|GTATGTGTAA...ATATCTTTAATC/ATATCTTTAATC...TCTAG|TAA | 0 | 33.91 | |
17203007 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 21 | 105417 | 105518 | Armadillidium vulgare 13347 | ATA|GTAAGAATTA...TATGTTATAAAG/CACAGACTAAAT...TGCAG|GGC | 0 | 37.432 | |
17203008 | GT-AG | 0 | 1.000000099473604e-05 | 153 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 22 | 105880 | 106032 | Armadillidium vulgare 13347 | CAA|GTAAAATAAA...ATTATCGTAATA/ATAAGTCTCATA...AATAG|GTG | 0 | 41.149 | |
17203009 | GT-AG | 0 | 0.0077958628460203 | 518 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 23 | 106513 | 107030 | Armadillidium vulgare 13347 | ACG|GTATGTATTT...CGTACTTTGATA/TTGATACTTATT...ATTAG|GTG | 0 | 46.092 | |
17203010 | GA-AG | 0 | 0.0011515847280079 | 3830 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 24 | 107138 | 110967 | Armadillidium vulgare 13347 | CAG|GAATATATAT...TAATATTTAATA/CTGTATTTAATA...TTCAG|AGA | 0 | 47.194 | |
17203011 | GT-AG | 0 | 3.303157680445547e-05 | 1734 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 25 | 111310 | 113043 | Armadillidium vulgare 13347 | TTT|GTAAGTTTAA...AATATAATAATA/AATATAATAATA...TGCAG|CTG | 0 | 50.716 | |
17203012 | TT-AA | 0 | 1.000000099473604e-05 | 238 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 26 | 113240 | 113477 | Armadillidium vulgare 13347 | AAT|TTCAGTTCCA...TTATTATTATTA/CTTATTATTATT...ATTAA|AAG | 0 | 52.734 | |
17203013 | GT-AG | 0 | 1.000000099473604e-05 | 154 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 27 | 113659 | 113812 | Armadillidium vulgare 13347 | CAG|GTAAACAAAT...TATGTATTAATT/TATGTATTAATT...TTCAG|GTG | 0 | 54.598 | |
17203014 | GT-AG | 0 | 0.0001568904361277 | 484 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 28 | 113993 | 114476 | Armadillidium vulgare 13347 | ATA|GTAAGTTGAA...TTTTTTTTAATG/TTTTTTTTAATG...ATTAG|GAT | 0 | 56.451 | |
17203015 | GT-AG | 0 | 1.000000099473604e-05 | 210 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 29 | 114689 | 114898 | Armadillidium vulgare 13347 | CAG|GTAATTCACA...TATATTTTATTT/ATTTTATTTATT...TTAAG|GTT | 0 | 58.635 | |
17203016 | GT-AG | 0 | 1.000000099473604e-05 | 406 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 30 | 115035 | 115440 | Armadillidium vulgare 13347 | AAG|GTGACAAATT...ATATATATATAT/TATATATATATA...TAAAG|GTT | 0 | 60.035 | |
17203017 | GT-AG | 0 | 0.4541027664089637 | 395 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 31 | 115586 | 115980 | Armadillidium vulgare 13347 | ATG|GTATTCTACA...TTTTCTTTAAAA/ATAAATTTCAAT...AACAG|GAT | 0 | 61.528 | |
17203018 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 32 | 116167 | 116424 | Armadillidium vulgare 13347 | GAA|GTAAGATTCA...ATATATATATAT/TATATATATATA...TTTAG|ATT | 0 | 63.444 | |
17203019 | GT-AG | 0 | 1.000000099473604e-05 | 2995 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 33 | 116551 | 119545 | Armadillidium vulgare 13347 | AAG|GTTGGAAAGA...TATATATTGATT/TATATATTGATT...TTTAG|TCT | 0 | 64.741 | |
17203020 | GT-AG | 0 | 0.0065973394611582 | 193 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 34 | 119841 | 120033 | Armadillidium vulgare 13347 | CAG|GTATTTTTTT...AATATTTTCATT/AATATTTTCATT...TTCAG|ATG | 0 | 67.779 | |
17203021 | GT-AG | 0 | 0.0149067074394071 | 308 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 35 | 120214 | 120521 | Armadillidium vulgare 13347 | ATA|GTATGTATTA...ATTCTATTATTA/AATTCTATTATT...TACAG|ATT | 0 | 69.632 | |
17203022 | GT-AG | 0 | 1.000000099473604e-05 | 145 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 36 | 120734 | 120878 | Armadillidium vulgare 13347 | CAG|GTAAGAAGCC...GATTTTTTAAAA/GATTTTTTAAAA...ATTAG|ACT | 0 | 71.815 | |
17203023 | GT-AG | 0 | 0.0006316460329964 | 1094 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 37 | 121015 | 122108 | Armadillidium vulgare 13347 | ATG|GTATTATATA...ATGTATTTAATT/ATTTAATTAATT...TTCAG|CTT | 0 | 73.216 | |
17203024 | GT-AG | 0 | 1.000000099473604e-05 | 749 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 38 | 122252 | 123000 | Armadillidium vulgare 13347 | TTG|GTAATATATA...CAATTCTAAATA/CCAATTCTAAAT...TTCAG|AAC | 0 | 74.688 | |
17203025 | GT-AG | 0 | 0.0012907383775483 | 284 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 39 | 123184 | 123467 | Armadillidium vulgare 13347 | GAG|GTTGCTTTAT...TTATTATTAATA/TTATTATTAATA...GTCAG|ATC | 0 | 76.573 | |
17203026 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 40 | 123594 | 123710 | Armadillidium vulgare 13347 | AAG|GTAAGTGTTT...GTAATCTTACGT/AATTAACTAAAT...TTGAG|GCT | 0 | 77.87 | |
17203027 | GT-AG | 0 | 0.0003280260945599 | 94 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 41 | 123889 | 123982 | Armadillidium vulgare 13347 | AAG|GTATTTCATA...ATTTATTTAATA/TATATATTTATT...AACAG|GAC | 0 | 79.703 | |
17203028 | GT-AG | 0 | 6.787852884266155e-05 | 2971 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 42 | 124279 | 127249 | Armadillidium vulgare 13347 | ATG|GTAAGCTTTA...ATATATTTATAT/TATATATTTATA...AATAG|GGT | 0 | 82.752 | |
17203029 | GT-AG | 0 | 1.0608333526773928e-05 | 283 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 43 | 127535 | 127817 | Armadillidium vulgare 13347 | CAA|GTTCGTCATA...ACGTTTTTAACA/ACGTTTTTAACA...TTTAG|GGA | 0 | 85.686 | |
17203030 | GT-TG | 0 | 1.000000099473604e-05 | 791 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 44 | 128127 | 128917 | Armadillidium vulgare 13347 | AAG|GTAAATTAAA...ATATATATAATA/ATATATATAATA...AAGTG|GTT | 0 | 88.868 | |
17203031 | GT-AG | 0 | 1.000000099473604e-05 | 494 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 45 | 129087 | 129580 | Armadillidium vulgare 13347 | ACA|GTGAGCATAT...TATATATTGTTT/TATATATATATT...TTAAG|AGT | 0 | 90.609 | |
17203032 | GT-AG | 0 | 2.1951236983868737e-05 | 1685 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 46 | 129793 | 131477 | Armadillidium vulgare 13347 | CAG|GTAATTTCTT...TTTTTTTTATAT/TTTTTTTTTATA...TTTAG|GTT | 0 | 92.792 | |
17203033 | GT-AG | 0 | 1.000000099473604e-05 | 302 | rna-gnl|WGS:SAUD|Avbf_03612-RA_mrna 3485097 | 47 | 131614 | 131915 | Armadillidium vulgare 13347 | CAG|GTTTGTAAAT...ATATATATAATA/TATAATATAATT...GACAG|GTT | 0 | 94.192 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);