introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
22 rows where transcript_id = 23220167
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
126296154 | GT-AG | 0 | 5.961232005594321e-05 | 132 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 1 | 1486330 | 1486461 | Neocallimastix sp. jgi-2020a 2767002 | CAA|GTTTGTAGAA...TTAATTTTAATT/TTAATTTTAATT...TTTAG|TTG | 0 | 1 | 1.935 |
126296155 | GT-AG | 0 | 0.0006133038251449 | 195 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 2 | 1486513 | 1486707 | Neocallimastix sp. jgi-2020a 2767002 | GAC|GTATGTAAAA...TTATTATTACTT/ATTATTATTACT...TTTAG|AAA | 0 | 1 | 2.933 |
126296156 | GT-AG | 0 | 0.039258083299511 | 131 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 3 | 1486776 | 1486906 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTATATTAAA...TTTTTTTTAAAT/TTTGTTTTTATT...ATTAG|GAG | 2 | 1 | 4.262 |
126296157 | GT-AG | 0 | 1.000000099473604e-05 | 100 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 4 | 1486976 | 1487075 | Neocallimastix sp. jgi-2020a 2767002 | TAG|GTAATTAAAA...AAAATGTTAATA/AAAATGTTAATA...TATAG|AGT | 2 | 1 | 5.611 |
126296158 | GT-AG | 0 | 0.0015780716626181 | 125 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 5 | 1487226 | 1487350 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTAATTTTAA...TATTTTTTATTT/TTTTTTTTCATA...TGAAG|CTA | 2 | 1 | 8.543 |
126296159 | GT-AG | 0 | 2.090351635758858e-05 | 84 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 6 | 1487411 | 1487494 | Neocallimastix sp. jgi-2020a 2767002 | TAG|GTAAATATAT...TTTTCCTTTATA/TTTTCCTTTATA...AATAG|ACA | 2 | 1 | 9.717 |
126296160 | GT-AG | 0 | 0.0006218637995162 | 150 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 7 | 1490332 | 1490481 | Neocallimastix sp. jgi-2020a 2767002 | ATA|GTAAGTTTTT...TATATTTTATAT/ATATATTTTATA...TAAAG|AAC | 1 | 1 | 65.181 |
126296161 | GT-AG | 0 | 1.000000099473604e-05 | 185 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 8 | 1490734 | 1490918 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTAAGTGGTA...TTAAAATTAAAA/TTAAAATTAAAA...AAAAG|ATA | 1 | 1 | 70.108 |
126296162 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 9 | 1490931 | 1491018 | Neocallimastix sp. jgi-2020a 2767002 | ATG|GTAAGATAAA...ATTATATTAATA/ATTATATTAATA...CATAG|AGG | 1 | 1 | 70.342 |
126296163 | GT-AG | 0 | 0.0010596493353677 | 93 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 10 | 1491081 | 1491173 | Neocallimastix sp. jgi-2020a 2767002 | GAT|GTAAGTTTTT...TTTCTATTAACT/TTTCTATTAACT...AATAG|AAT | 0 | 1 | 71.554 |
126296164 | GT-AG | 0 | 0.0005704877228161 | 113 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 11 | 1491292 | 1491404 | Neocallimastix sp. jgi-2020a 2767002 | ATG|GTATATAAAA...GTATCATTAAAA/ATTTGTATCATT...AAAAG|CCC | 1 | 1 | 73.861 |
126296165 | GT-AG | 0 | 0.0001762432870044 | 121 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 12 | 1491476 | 1491596 | Neocallimastix sp. jgi-2020a 2767002 | TAT|GTAAATATTT...AAATATTTAATT/AAATATTTAATT...TTAAG|GAA | 0 | 1 | 75.249 |
126296166 | GT-AG | 0 | 1.000000099473604e-05 | 99 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 13 | 1491735 | 1491833 | Neocallimastix sp. jgi-2020a 2767002 | CAA|GTAATAAATC...ATGTTCATAATT/TCTATGTTCATA...AACAG|CAA | 0 | 1 | 77.947 |
126296167 | GT-AG | 0 | 1.000000099473604e-05 | 182 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 14 | 1491960 | 1492141 | Neocallimastix sp. jgi-2020a 2767002 | ATG|GTAATTATAT...ATAATTTAAATT/ATAATAATAATT...TATAG|GAT | 0 | 1 | 80.411 |
126296168 | GT-AG | 0 | 1.000000099473604e-05 | 81 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 15 | 1492246 | 1492326 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTATTAAAAT...TAACAGTTAATA/ATAATAATAACA...AATAG|TCA | 2 | 1 | 82.444 |
126296169 | GT-AG | 0 | 0.0158556509361248 | 83 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 16 | 1492396 | 1492478 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTATATATTT...TATATATTAATA/AATTTATTTATT...TTTAG|ACA | 2 | 1 | 83.793 |
126296170 | GT-AG | 0 | 0.0001243640470139 | 121 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 17 | 1492588 | 1492708 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAGAGTTTA...ATAATATTAATT/ATAATATTAATT...ATTAG|ATT | 0 | 1 | 85.924 |
126296171 | GT-AG | 0 | 0.0076762138418056 | 91 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 18 | 1492788 | 1492878 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTATGTTAAA...TATTTATTAATT/TATTTATTAATT...ATTAG|ATG | 1 | 1 | 87.468 |
126296172 | GT-AG | 0 | 1.368577433275099e-05 | 237 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 19 | 1493040 | 1493276 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTATAGCCAT...TTATTATTATTA/ATTATTATTATT...TATAG|AAG | 0 | 1 | 90.616 |
126296173 | GT-AG | 0 | 0.0004960978531571 | 91 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 20 | 1493307 | 1493397 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTAAGTTTTT...ATATTTTTAAAT/AATATATTAATT...AATAG|TTT | 0 | 1 | 91.202 |
126296174 | GT-AG | 0 | 1.859547187855764e-05 | 145 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 21 | 1493467 | 1493611 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTATAAAAAT...TTATTATTAAAT/TTATTATTAAAT...AAAAG|ACA | 0 | 1 | 92.551 |
126296175 | GT-AG | 0 | 1.6429171335818723e-05 | 170 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415852 23220167 | 22 | 1493899 | 1494068 | Neocallimastix sp. jgi-2020a 2767002 | TAC|GTGAGTTTCA...TTATTTTTATTA/TTTATTTTTATT...TTTAG|ATA | 2 | 1 | 98.162 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);