introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
40 rows where transcript_id = 6061972
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
31221885 | GT-AG | 0 | 1.000000099473604e-05 | 19362 | rna-XM_030448987.1 6061972 | 1 | 133614177 | 133633538 | Calypte anna 9244 | AAG|GTGAGACCCA...TTTCTCTTAAGT/TTTTCTCTTAAG...CTCAG|CTT | 1 | 1 | 2.974 |
31221886 | GT-AG | 0 | 0.0003569643976575 | 10548 | rna-XM_030448987.1 6061972 | 2 | 133633601 | 133644148 | Calypte anna 9244 | CAG|GTATGTAGGC...AGTGTCTTAAAT/AGTGTCTTAAAT...TACAG|ACT | 0 | 1 | 4.065 |
31221887 | GT-AG | 0 | 1.1422582718819776e-05 | 6186 | rna-XM_030448987.1 6061972 | 3 | 133644190 | 133650375 | Calypte anna 9244 | AAG|GTAAGCATCA...GTATCTTTTACA/AGCATTTTCATT...TGCAG|TGT | 2 | 1 | 4.787 |
31221888 | GT-AG | 0 | 0.0007037919112662 | 5764 | rna-XM_030448987.1 6061972 | 4 | 133650445 | 133656208 | Calypte anna 9244 | ACA|GTAAGTTGTT...GACATCTTAACA/GACATCTTAACA...GACAG|GCC | 2 | 1 | 6.001 |
31221889 | GT-AG | 0 | 6.046399947307861e-05 | 836 | rna-XM_030448987.1 6061972 | 5 | 133656333 | 133657168 | Calypte anna 9244 | AGG|GTAAGTTATT...GCTTCCTTACTT/GTTTTGCTTATT...TCCAG|AAA | 0 | 1 | 8.184 |
31221890 | GT-AG | 0 | 1.000000099473604e-05 | 985 | rna-XM_030448987.1 6061972 | 6 | 133657286 | 133658270 | Calypte anna 9244 | CAG|GTAAGAAAAC...TTTTCTTTTTCT/AACTATCTAAGC...ATCAG|GTG | 0 | 1 | 10.243 |
31221891 | GT-AG | 0 | 1.000000099473604e-05 | 679 | rna-XM_030448987.1 6061972 | 7 | 133658394 | 133659072 | Calypte anna 9244 | CAG|GTGAGCCTAA...ATCCCTTTAAAG/GAGTTGTTAATG...CCCAG|GTG | 0 | 1 | 12.408 |
31221892 | GT-AG | 0 | 1.000000099473604e-05 | 2540 | rna-XM_030448987.1 6061972 | 8 | 133659154 | 133661693 | Calypte anna 9244 | TGG|GTGAGTAGTC...TGTGCTTTGATC/TTTTTTGTAATT...ATTAG|GAA | 0 | 1 | 13.833 |
31221893 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-XM_030448987.1 6061972 | 9 | 133661845 | 133661956 | Calypte anna 9244 | CAG|GTTTGTAGAA...TTTTTCTTTTTT/AACTTTTTAAGT...TAAAG|CAC | 1 | 1 | 16.491 |
31221894 | GT-AG | 0 | 0.0001251524807788 | 931 | rna-XM_030448987.1 6061972 | 10 | 133662014 | 133662944 | Calypte anna 9244 | AAG|GTAAACTGTT...TGTTTTTTGTTC/AATAGCCTAACA...AACAG|ATC | 1 | 1 | 17.494 |
31221895 | GT-AG | 0 | 0.001872372279336 | 4352 | rna-XM_030448987.1 6061972 | 11 | 133663007 | 133667358 | Calypte anna 9244 | ACA|GTAAGTTTTA...CATGTTTTGATT/CATGTTTTGATT...TGCAG|AGT | 0 | 1 | 18.585 |
31221896 | GT-AG | 0 | 1.000000099473604e-05 | 1892 | rna-XM_030448987.1 6061972 | 12 | 133667468 | 133669359 | Calypte anna 9244 | ATG|GTAAGTAACT...TTTTTTTTGTCT/GTCTGCTTCAAC...CACAG|CAT | 1 | 1 | 20.503 |
31221897 | GT-AG | 0 | 0.0002873960230132 | 787 | rna-XM_030448987.1 6061972 | 13 | 133669469 | 133670255 | Calypte anna 9244 | TGG|GTAAGCTTAT...AAGTCCTAATTT/AAAGTCCTAATT...AATAG|ATA | 2 | 1 | 22.422 |
31221898 | GT-AG | 0 | 0.0001512619069713 | 100 | rna-XM_030448987.1 6061972 | 14 | 133670410 | 133670509 | Calypte anna 9244 | AAA|GTAAGCCCCA...ACTTTCTTAAAT/TCATTATTCACT...TCTAG|GTC | 0 | 1 | 25.132 |
31221899 | GT-AG | 0 | 0.0001167709302544 | 310 | rna-XM_030448987.1 6061972 | 15 | 133670681 | 133670990 | Calypte anna 9244 | GAT|GTAAGTATTT...ATACCTTTATTA/TTGTATTTAACT...TCTAG|TAC | 0 | 1 | 28.141 |
31221900 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_030448987.1 6061972 | 16 | 133671102 | 133671187 | Calypte anna 9244 | GAG|GTAAGGATGG...CCTCTCTTGTTA/TTGTTACTGAAA...CCAAG|CTG | 0 | 1 | 30.095 |
31221901 | GT-AG | 0 | 1.000000099473604e-05 | 467 | rna-XM_030448987.1 6061972 | 17 | 133671323 | 133671789 | Calypte anna 9244 | TCA|GTAAGTAATC...GACTCTTTTTCC/TTTCTATTCAGT...TACAG|GTA | 0 | 1 | 32.471 |
31221902 | GT-AG | 0 | 0.2106515494465013 | 1262 | rna-XM_030448987.1 6061972 | 18 | 133671895 | 133673156 | Calypte anna 9244 | GAG|GTATATTTGT...ATTTTCTTGATG/TTTTTTTTCAAA...TCCAG|ATT | 0 | 1 | 34.319 |
31221903 | GT-AG | 0 | 1.000000099473604e-05 | 4265 | rna-XM_030448987.1 6061972 | 19 | 133673244 | 133677508 | Calypte anna 9244 | CCG|GTAAGAGGGT...AATTCCTCACCT/TGCTTTTTCATC...TTTAG|GGG | 0 | 1 | 35.85 |
31221904 | GT-AG | 0 | 0.0001268390987553 | 148 | rna-XM_030448987.1 6061972 | 20 | 133677662 | 133677809 | Calypte anna 9244 | AAG|GTTTGTTTAT...GCTCTTTTACCT/ATTATATTTACT...TGCAG|ATT | 0 | 1 | 38.543 |
31221905 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-XM_030448987.1 6061972 | 21 | 133677886 | 133677984 | Calypte anna 9244 | AAT|GTAAGTAACT...TTTTCCATGAAA/TTGTTTTCCATG...TAAAG|CGG | 1 | 1 | 39.88 |
31221906 | GT-AG | 0 | 1.000000099473604e-05 | 1051 | rna-XM_030448987.1 6061972 | 22 | 133678103 | 133679153 | Calypte anna 9244 | TAG|GTAAGGATTT...TAAGTTTTAAAA/TTCAATCTTACT...TCCAG|ACT | 2 | 1 | 41.957 |
31221907 | GT-AG | 0 | 0.0003360772513881 | 100 | rna-XM_030448987.1 6061972 | 23 | 133679293 | 133679392 | Calypte anna 9244 | TCA|GTAAGTTATT...ATTACATTAACC/ATTACATTAACC...AATAG|GAA | 0 | 1 | 44.403 |
31221908 | GT-AG | 0 | 1.000000099473604e-05 | 409 | rna-XM_030448987.1 6061972 | 24 | 133679503 | 133679911 | Calypte anna 9244 | ACG|GTAAGTATTG...ATGTCATTATCT/TCTTTCCTGACC...TTCAG|GTT | 2 | 1 | 46.339 |
31221909 | GT-AG | 0 | 0.0001374997448792 | 983 | rna-XM_030448987.1 6061972 | 25 | 133679979 | 133680961 | Calypte anna 9244 | TGT|GTAAGTATTT...TTTTTCCTACTT/ATTTTTCCTACT...TAAAG|GTC | 0 | 1 | 47.518 |
31221910 | GT-AG | 0 | 0.0001403153545529 | 663 | rna-XM_030448987.1 6061972 | 26 | 133681114 | 133681776 | Calypte anna 9244 | CAG|GTATGATAGA...TTTTTCTTTATG/TTTTTCTTTATG...CACAG|AAA | 2 | 1 | 50.194 |
31221911 | GT-AG | 0 | 1.000000099473604e-05 | 1263 | rna-XM_030448987.1 6061972 | 27 | 133681920 | 133683182 | Calypte anna 9244 | AAA|GTGAGTACTA...TATCTGTTAGCA/ACAGCACTAATT...TCTAG|AAA | 1 | 1 | 52.71 |
31221912 | GT-AG | 0 | 1.000000099473604e-05 | 1155 | rna-XM_030448987.1 6061972 | 28 | 133683370 | 133684524 | Calypte anna 9244 | ATT|GTGAGTGTTT...ATGCTGTTGATG/GAAGTTCTAACG...TTCAG|GTT | 2 | 1 | 56.001 |
31221913 | GT-AG | 0 | 1.000000099473604e-05 | 3310 | rna-XM_030448987.1 6061972 | 29 | 133684711 | 133688020 | Calypte anna 9244 | GAG|GTAGTGTGTA...TGCTTCTTTTCT/AAAATGCTCAAA...TGCAG|TGT | 2 | 1 | 59.275 |
31221914 | GT-AG | 0 | 1.000000099473604e-05 | 562 | rna-XM_030448987.1 6061972 | 30 | 133688180 | 133688741 | Calypte anna 9244 | AAG|GTAAGTATAA...GTGTATTTAAAC/GTGTATTTAAAC...TCCAG|CTT | 2 | 1 | 62.073 |
31221915 | GT-AG | 0 | 1.000000099473604e-05 | 3925 | rna-XM_030448987.1 6061972 | 31 | 133689338 | 133693262 | Calypte anna 9244 | AAG|GTAATACAAT...GCATCCTTATCT/AGCATCCTTATC...CCCAG|GGA | 1 | 1 | 72.562 |
31221916 | GT-AG | 0 | 1.000000099473604e-05 | 1207 | rna-XM_030448987.1 6061972 | 32 | 133693439 | 133694645 | Calypte anna 9244 | GAG|GTAGGGCACT...CTTTTCATAAAT/TTACTTTTCATA...TATAG|GAG | 0 | 1 | 75.66 |
31221917 | GT-AG | 0 | 1.000000099473604e-05 | 1409 | rna-XM_030448987.1 6061972 | 33 | 133694774 | 133696182 | Calypte anna 9244 | CAG|GTAAGCACAA...ATAGCTTTGGCA/AGAGAGCTTAAA...TGTAG|TTC | 2 | 1 | 77.913 |
31221918 | GT-AG | 0 | 1.000000099473604e-05 | 139 | rna-XM_030448987.1 6061972 | 34 | 133696308 | 133696446 | Calypte anna 9244 | GAG|GTAGTGTGAG...TTTTTCTTGTGT/GTTCGGTTTACT...CATAG|GTC | 1 | 1 | 80.113 |
31221919 | GT-AG | 0 | 7.103070745404702e-05 | 885 | rna-XM_030448987.1 6061972 | 35 | 133696606 | 133697490 | Calypte anna 9244 | GCT|GTAAGTAATC...TTGTTCTTTTTT/CAGAAATGTATT...TACAG|CTA | 1 | 1 | 82.911 |
31221920 | GT-AG | 0 | 1.000000099473604e-05 | 347 | rna-XM_030448987.1 6061972 | 36 | 133697675 | 133698021 | Calypte anna 9244 | CAA|GTGAGTCCAA...ATTTTCTTTTTT/AACCTAATTATT...TTAAG|TGA | 2 | 1 | 86.149 |
31221921 | GT-AG | 0 | 1.000000099473604e-05 | 110 | rna-XM_030448987.1 6061972 | 37 | 133698144 | 133698253 | Calypte anna 9244 | CAG|GTACTAGAAC...TGTTTTTTATAT/ATGTTTTTTATA...TGCAG|GCG | 1 | 1 | 88.296 |
31221922 | GT-AG | 0 | 0.0002252829664272 | 1080 | rna-XM_030448987.1 6061972 | 38 | 133698351 | 133699430 | Calypte anna 9244 | CAG|GTAATTTTTA...TATTCCATAATA/TCTGTATTGATT...CTCAG|CTC | 2 | 1 | 90.004 |
31221923 | GT-AG | 0 | 1.000000099473604e-05 | 1269 | rna-XM_030448987.1 6061972 | 39 | 133699529 | 133700797 | Calypte anna 9244 | GAG|GTAAGTTTTT...TTTGTTTCATTT/TTTTGTTTCATT...TTCAG|AGG | 1 | 1 | 91.728 |
31221924 | GT-AG | 0 | 9.93246514579786e-05 | 3390 | rna-XM_030448987.1 6061972 | 40 | 133700962 | 133704351 | Calypte anna 9244 | GCA|GTAAGTATTC...TGCTGCTTACCT/CTGCTGCTTACC...TGCAG|TCG | 0 | 1 | 94.615 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);