introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
45 rows where transcript_id = 23220162
This data as json, CSV (advanced)
Suggested facets: phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
126296014 | GT-AG | 0 | 0.0001153212106871 | 260 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 1 | 384339 | 384598 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAGTATTA...TTATTTTTACTC/TTGGTTATCATT...TTTAG|GAA | 2 | 1 | 3.79 |
126296015 | GT-AG | 0 | 0.0008086909263588 | 103 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 2 | 384158 | 384260 | Neocallimastix sp. jgi-2020a 2767002 | TCT|GTAAGTCTAA...TTTTTTTTATCA/TTTTTTTTTATC...AATAG|TTA | 2 | 1 | 5.11 |
126296016 | GT-AG | 0 | 1.000000099473604e-05 | 1979 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 3 | 382153 | 384131 | Neocallimastix sp. jgi-2020a 2767002 | ATG|GTGATGAATG...AAAATGTTAATA/AAGAATTTAATT...ATTAG|CGC | 1 | 1 | 5.55 |
126296017 | GT-AG | 0 | 3.969539295359619e-05 | 260 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 5 | 381716 | 381975 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAGTATTA...TTATCATTATTC/TCATTATTCACT...TTTAG|GAA | 2 | 1 | 8.156 |
126296018 | GT-AG | 0 | 1.993906648784544e-05 | 116 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 6 | 381527 | 381642 | Neocallimastix sp. jgi-2020a 2767002 | CGA|GTTCTGTAAG...TTTTTTTTATCA/TTTTTTTTTATC...TTTAG|AAG | 0 | 1 | 9.391 |
126296019 | GT-AG | 0 | 1.000000099473604e-05 | 152 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 7 | 381289 | 381440 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAAAATAA...GTTTTCATATTT/TATATTCTAATT...GGTAG|ATC | 2 | 1 | 10.846 |
126296020 | GT-AG | 0 | 0.0005999896275398 | 143 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 8 | 381074 | 381216 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAATTTATT...TATTTTATAATT/ATTTTTCTAATA...TTTAG|AAA | 2 | 1 | 12.064 |
126296021 | GT-AG | 0 | 1.000000099473604e-05 | 124 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 9 | 380875 | 380998 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAATAAATA...TATTTTTTATAC/TTTTTTTTCATT...ATTAG|AAC | 2 | 1 | 13.333 |
126296022 | GT-AG | 0 | 0.0002941119585937 | 142 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 10 | 380643 | 380784 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAATATAG...ATATTTTTATTT/TATTTATTTATT...TGCAG|GTA | 2 | 1 | 14.856 |
126296023 | GT-AG | 0 | 1.000000099473604e-05 | 292 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 11 | 380279 | 380570 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAATAATAT...TTATTATTATTT/TTTATATTTATT...GTTAG|ATC | 2 | 1 | 16.074 |
126296024 | GT-AG | 0 | 0.0001480177708172 | 125 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 12 | 380091 | 380215 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAATAAAA...TTGTTTTTATTT/ATTGTTTTTATT...ATTAG|ATC | 2 | 1 | 17.14 |
126296025 | GT-AG | 0 | 9.923335501726257e-05 | 111 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 13 | 379905 | 380015 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAGTCCTA...TTTTTTTTATTT/ATTTTTTTTATT...CATAG|AGC | 2 | 1 | 18.409 |
126296026 | GT-AG | 0 | 0.0165207711433963 | 110 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 14 | 379726 | 379835 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAATTTTT...TTTGTTTTATCA/TTAATATTGATT...TTTAG|AAC | 2 | 1 | 19.577 |
126296027 | GT-AG | 0 | 2.2062646292499725e-05 | 185 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 15 | 379478 | 379662 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAGTAAAA...TCTATTTTAAAT/TCTATTTTAAAT...TAAAG|AAA | 2 | 1 | 20.643 |
126296028 | GT-AG | 0 | 1.2702144006904854e-05 | 140 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 16 | 379266 | 379405 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAATAAAA...TCATTCATAAAA/ATTTATTTCATA...TAAAG|ATA | 2 | 1 | 21.861 |
126296029 | GT-AG | 0 | 0.0006356496541397 | 163 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 17 | 379031 | 379193 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAATCATAA...AACACATTAATA/CATATTATTATT...TATAG|ACG | 2 | 1 | 23.08 |
126296030 | GT-AG | 0 | 0.0002799110434258 | 148 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 18 | 378814 | 378961 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAGTTTCT...GGATATTTAAAT/TTTAAATTAATA...AATAG|ATC | 2 | 1 | 24.247 |
126296031 | GT-AG | 0 | 0.1670483563569761 | 106 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 19 | 378639 | 378744 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAACTATTT...TAATCATTATTT/TATATATTAATT...ATAAG|TGA | 2 | 1 | 25.415 |
126296032 | GT-AG | 0 | 0.000873871816814 | 386 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 20 | 378184 | 378569 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAATTGAT...TTTTTTTTATTA/TTTTTTTTTATT...TAAAG|TGA | 2 | 1 | 26.582 |
126296033 | GT-AG | 0 | 3.077695398733438e-05 | 110 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 21 | 378005 | 378114 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAATTGATT...TATGACTTAATT/CATATATTAATT...AAAAG|TCG | 2 | 1 | 27.75 |
126296034 | GT-AG | 0 | 0.0022289662304306 | 201 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 22 | 377735 | 377935 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAATATAA...ATTTTCTTAATT/TAATTTTTTATT...AATAG|AGA | 2 | 1 | 28.917 |
126296035 | GT-AG | 0 | 0.0003273819515071 | 159 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 23 | 377507 | 377665 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAGTGTTT...TTTGCTTTAATT/ATTTTTTTTATT...AAAAG|GAA | 2 | 1 | 30.085 |
126296036 | GT-AG | 0 | 4.602757786247359e-05 | 255 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 24 | 377183 | 377437 | Neocallimastix sp. jgi-2020a 2767002 | CCT|GTAAAAATAG...ATATTTTTAATT/ATATTTTTAATT...ATTAG|CAA | 2 | 1 | 31.252 |
126296037 | GT-AG | 0 | 1.000000099473604e-05 | 131 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 25 | 376980 | 377110 | Neocallimastix sp. jgi-2020a 2767002 | CAT|GTAAGTAAAA...TATCCTATATTT/CCTATATTTATA...CAAAG|AAA | 2 | 1 | 32.47 |
126296038 | GT-AG | 0 | 0.0002688693402435 | 232 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 26 | 376334 | 376565 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAGTTGAA...TTTTTTTTATTT/TTTTTTTTTATT...AATAG|GAA | 2 | 1 | 39.475 |
126296039 | GT-AG | 0 | 0.0001789224947498 | 124 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 27 | 376132 | 376255 | Neocallimastix sp. jgi-2020a 2767002 | ATT|GTAAATATTT...TTTTCTTTTTCA/TTCTTTTTCATT...TATAG|AAC | 2 | 1 | 40.795 |
126296040 | GT-AG | 0 | 0.2402895417185944 | 168 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 28 | 375883 | 376050 | Neocallimastix sp. jgi-2020a 2767002 | ACT|GTATGTTCTA...TTATTATTAATT/TTATTATTAATT...AAAAG|TGA | 2 | 1 | 42.166 |
126296041 | GT-AG | 0 | 0.0002787598959346 | 665 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 29 | 375146 | 375810 | Neocallimastix sp. jgi-2020a 2767002 | GTT|GTAAGTATAA...TTATTTTTAATA/TTATTTTTAATA...CATAG|TAA | 2 | 1 | 43.384 |
126296042 | GT-AG | 0 | 0.0001386761957453 | 230 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 30 | 374841 | 375070 | Neocallimastix sp. jgi-2020a 2767002 | CCT|GTAAATATAA...AAATGTTTAATG/AATGTATTTATT...AAAAG|TCA | 2 | 1 | 44.653 |
126296043 | GT-AG | 0 | 0.000175486351892 | 184 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 31 | 374585 | 374768 | Neocallimastix sp. jgi-2020a 2767002 | TCT|GTAATATTTA...ATATTATTAAAA/TGAATATTTATT...AAAAG|GAC | 2 | 1 | 45.871 |
126296044 | GT-AG | 0 | 1.000000099473604e-05 | 128 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 32 | 374385 | 374512 | Neocallimastix sp. jgi-2020a 2767002 | CTT|GTTAGTATAT...TATTTGTTATTT/AAATTTCTAACA...AATAG|AGA | 2 | 1 | 47.09 |
126296045 | GT-AG | 0 | 3.697331858793045e-05 | 185 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 33 | 374122 | 374306 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTAAATAAAA...TTTTTCATATCA/ATTTTTTTCATA...CATAG|AAC | 2 | 1 | 48.409 |
126296046 | GT-AG | 0 | 0.9375788590996662 | 145 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 34 | 373902 | 374046 | Neocallimastix sp. jgi-2020a 2767002 | TCT|GTAACTTTAT...TATTTTTTATTA/TTATTTTTTATT...TATAG|TGC | 2 | 1 | 49.679 |
126296047 | GT-AG | 0 | 1.000000099473604e-05 | 285 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 35 | 373548 | 373832 | Neocallimastix sp. jgi-2020a 2767002 | TCT|GTTAGTGTTT...TTATGTTTAAAT/TTATGTTTAAAT...TTTAG|TAA | 2 | 1 | 50.846 |
126296048 | GT-AG | 0 | 0.0042440630793724 | 117 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 36 | 373362 | 373478 | Neocallimastix sp. jgi-2020a 2767002 | CCT|GTAAGTTTTA...AATTCATTAATT/TTTTTTCTTAAA...TATAG|TTA | 2 | 1 | 52.014 |
126296049 | GT-AG | 0 | 1.000000099473604e-05 | 246 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 37 | 373044 | 373289 | Neocallimastix sp. jgi-2020a 2767002 | TTT|GTTAGTATAT...ATAATTTTAATT/ATAATTTTAATT...TTTAG|AGG | 2 | 1 | 53.232 |
126296050 | GT-AG | 0 | 0.178929314255041 | 151 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 38 | 372824 | 372974 | Neocallimastix sp. jgi-2020a 2767002 | TCT|GTATGTATTC...AATATTTTAATA/TATATATTAACT...TATAG|TGA | 2 | 1 | 54.399 |
126296051 | GT-AG | 0 | 0.0001265922554723 | 211 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 39 | 372541 | 372751 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAATTAAAA...TTTTTTTTAATT/TTTTTTTTAATT...ATTAG|TGA | 2 | 1 | 55.618 |
126296052 | GT-AG | 0 | 1.3324180491248617e-05 | 889 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 40 | 370498 | 371386 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTAATAATTA...AATTCCTTAGGA/TTAAATTTCATC...AAGAG|TGT | 1 | 1 | 75.144 |
126296053 | GT-AG | 0 | 0.0005991400758937 | 249 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 42 | 369810 | 370058 | Neocallimastix sp. jgi-2020a 2767002 | TTG|GTTTTTATTT...AAATTCTTATAG/AAAATTCTTATA...TCTAG|TGA | 1 | 1 | 82.2 |
126296054 | GT-AG | 0 | 0.0030932334692643 | 93 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 43 | 369656 | 369748 | Neocallimastix sp. jgi-2020a 2767002 | AAT|GTCTTTTGAT...TTTGTTTTATTT/GTTTGTTTTATT...CTTAG|TAT | 2 | 1 | 83.232 |
126296055 | GT-AG | 0 | 1.000000099473604e-05 | 127 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 44 | 369429 | 369555 | Neocallimastix sp. jgi-2020a 2767002 | AAA|GTTAGAAAAT...ATATTATTAGCA/CTATTATTCAAT...CTTAG|GTT | 0 | 1 | 84.924 |
126296056 | GT-AG | 0 | 1.6643722150614582e-05 | 46 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 46 | 369066 | 369111 | Neocallimastix sp. jgi-2020a 2767002 | CTG|GTGCTCTACA...TATCACTAAATT/ATATCACTAAAT...CAAAG|ATG | 0 | 1 | 89.949 |
126296057 | GT-AG | 0 | 1.000000099473604e-05 | 215 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 47 | 368604 | 368818 | Neocallimastix sp. jgi-2020a 2767002 | AAG|GTGAACAATC...TATTCCTTCTTG/TGGAATTTCAGC...TTAAG|ATG | 1 | 1 | 94.129 |
126296058 | GT-AG | 0 | 1.6354220576739667e-05 | 43 | rna-gnl|WGS:JACVTC|H8356DRAFT_mRNA1415747 23220162 | 49 | 368279 | 368321 | Neocallimastix sp. jgi-2020a 2767002 | ATA|GTAAGTAATT...AACTTTTTGATG/AACTTTTTGATG...CAAAG|GAG | 0 | 1 | 98.426 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);