introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
19 rows where transcript_id = 23199990
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
126230681 | GT-AG | 0 | 0.0016584042918138 | 198 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 1 | 1595599 | 1595796 | Neocallimastix californiae 1754190 | AAG|GTATATCAAT...TTTCCTTTAATT/GCTATTTTTATT...CATAG|TTG | 1 | 1 | 1.33 |
126230682 | GT-AG | 0 | 0.0004750857128531 | 181 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 2 | 1595985 | 1596165 | Neocallimastix californiae 1754190 | AAA|GTATAAAAAT...TTATTTTTAATA/TTATTTTTAATA...AAAAG|GGT | 0 | 1 | 10.256 |
126230683 | GT-AG | 0 | 9.67166592412286e-05 | 414 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 3 | 1596306 | 1596719 | Neocallimastix californiae 1754190 | AAC|GTAAAATATT...TATTTTTTGATA/TATTTTTTGATA...ACTAG|TTA | 2 | 1 | 16.904 |
126230684 | GT-AG | 0 | 1.000000099473604e-05 | 263 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 4 | 1596871 | 1597133 | Neocallimastix californiae 1754190 | GAA|GTAAGAAGTT...TTTTTTTTATTC/TTTTTTTTTATT...AACAG|GAA | 0 | 1 | 24.074 |
126230685 | GT-AG | 0 | 2.379851575961469e-05 | 134 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 5 | 1597219 | 1597352 | Neocallimastix californiae 1754190 | ATG|GTAATTTTAT...TTTGTATTATAT/TTTATATTTATA...ATAAG|GGG | 1 | 1 | 28.11 |
126230686 | GT-AG | 0 | 0.0002289561328295 | 134 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 6 | 1597442 | 1597575 | Neocallimastix californiae 1754190 | GTG|GTAAATTTTT...TATTACTTATCA/TTATTACTTATC...AATAG|GGT | 0 | 1 | 32.336 |
126230687 | GT-AG | 0 | 1.000000099473604e-05 | 132 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 7 | 1597690 | 1597821 | Neocallimastix californiae 1754190 | TTG|GTAATAAATT...TAATATTTAATT/ATTTAATTAATT...AACAG|CCT | 0 | 1 | 37.749 |
126230688 | GT-AG | 0 | 1.000000099473604e-05 | 112 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 8 | 1597907 | 1598018 | Neocallimastix californiae 1754190 | AAG|GTAAGTACAA...TGATTATTAATA/TGATTATTAATA...TAAAG|GTT | 1 | 1 | 41.785 |
126230689 | GT-AG | 0 | 0.0327742578155525 | 221 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 9 | 1598130 | 1598350 | Neocallimastix californiae 1754190 | CCG|GTATATATAT...TTTTCTTTAAAT/AAATTTTTTACT...TAAAG|ATG | 1 | 1 | 47.056 |
126230690 | GT-AG | 0 | 1.000000099473604e-05 | 148 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 10 | 1598496 | 1598643 | Neocallimastix californiae 1754190 | AGG|GTAAATATAC...TTATCATTATTT/ATTTTGTTCACC...AATAG|TGT | 2 | 1 | 53.941 |
126230691 | GT-AG | 0 | 0.0488516576823477 | 113 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 11 | 1598774 | 1598886 | Neocallimastix californiae 1754190 | ATA|GTATTCATAT...GATATATTAAAA/TAGTATTTTATG...TATAG|TAT | 0 | 1 | 60.114 |
126230692 | GT-AG | 0 | 1.6498039491913273e-05 | 133 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 12 | 1598965 | 1599097 | Neocallimastix californiae 1754190 | GAT|GTAAATGGAA...TTTTTTTTATTA/ATTTTTTTTATT...AATAG|CAT | 0 | 1 | 63.818 |
126230693 | GT-AG | 0 | 0.0011752037993767 | 140 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 13 | 1599196 | 1599335 | Neocallimastix californiae 1754190 | AAA|GTAATTTTTA...AAAATTTTAATC/AAAATTTTAATC...ATTAG|GCA | 2 | 1 | 68.471 |
126230694 | GT-AG | 0 | 0.0001107804355342 | 102 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 14 | 1599384 | 1599485 | Neocallimastix californiae 1754190 | AAC|GTAAATATTT...TAATTATTGATT/TAATTATTGATT...TAAAG|TGA | 2 | 1 | 70.75 |
126230695 | GT-AG | 0 | 0.001098257248942 | 357 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 15 | 1599549 | 1599905 | Neocallimastix californiae 1754190 | GAA|GTACGTTATA...ATTATATTAATT/ATTATATTAATT...AATAG|TAA | 2 | 1 | 73.742 |
126230696 | GT-AG | 0 | 1.000000099473604e-05 | 201 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 16 | 1600117 | 1600317 | Neocallimastix californiae 1754190 | ATG|GTAAAAAAAA...TTTACATTAACT/ATTATATTTACA...AATAG|AAG | 0 | 1 | 83.761 |
126230697 | GT-AG | 0 | 0.0002145831792917 | 169 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 17 | 1600390 | 1600558 | Neocallimastix californiae 1754190 | TCT|GTTCGTATTT...GATTACTTAAAT/AATAAATTTATT...TATAG|AGT | 0 | 1 | 87.179 |
126230698 | GT-AG | 0 | 1.013625625149432e-05 | 174 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 18 | 1600630 | 1600803 | Neocallimastix californiae 1754190 | TAA|GTAAAATCAA...TTTTTTTTGAAT/TTTTTTTTGAAT...AATAG|GAA | 2 | 1 | 90.551 |
126230699 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-gnl|WGS:MCOG|LY90DRAFT_mRNA696725 23199990 | 19 | 1600935 | 1601093 | Neocallimastix californiae 1754190 | AAG|GTAAAAAAAA...TTTTTTTTAATT/TTTTTTTTAATT...AATAG|AAG | 1 | 1 | 96.771 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);