introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
30 rows where transcript_id = 15214374
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
82387450 | GT-AG | 0 | 1.000000099473604e-05 | 10242 | rna-XM_033943151.1 15214374 | 2 | 194055766 | 194066007 | Geotrypetes seraphini 260995 | AAA|GTAAGTAGTG...AAATATTTATTC/CAAATATTTATT...AACAG|CGC | 0 | 1 | 2.609 |
82387451 | GT-AG | 0 | 1.000000099473604e-05 | 6863 | rna-XM_033943151.1 15214374 | 3 | 194048786 | 194055648 | Geotrypetes seraphini 260995 | GGG|GTGAGTTGAT...TTTTTTTTCATT/TTTTTTTTCATT...CCTAG|TTT | 0 | 1 | 4.47 |
82387452 | GT-AG | 0 | 5.507124851403329e-05 | 89 | rna-XM_033943151.1 15214374 | 4 | 194048619 | 194048707 | Geotrypetes seraphini 260995 | AAG|GTATGGAATA...TTTTCTTTACCT/CTTTTCTTTACC...TCCAG|GAT | 0 | 1 | 5.71 |
82387453 | GT-AG | 0 | 1.000000099473604e-05 | 2759 | rna-XM_033943151.1 15214374 | 5 | 194045749 | 194048507 | Geotrypetes seraphini 260995 | GGG|GTGAGTGACT...CAAACTTTATTT/ACAAACTTTATT...TTTAG|TCC | 0 | 1 | 7.476 |
82387454 | GT-AG | 0 | 1.000000099473604e-05 | 1764 | rna-XM_033943151.1 15214374 | 6 | 194043414 | 194045177 | Geotrypetes seraphini 260995 | GAG|GTAATTGCAA...ATTTTCAAAACT/TGGATTTTCAAA...CGCAG|ATT | 1 | 1 | 16.558 |
82387455 | GT-AG | 0 | 1.5656232863282812e-05 | 8049 | rna-XM_033943151.1 15214374 | 7 | 194035165 | 194043213 | Geotrypetes seraphini 260995 | TTT|GTAAGTAGCG...TTTCTCTTTGTG/TTTGATCTAATA...TGTAG|TTT | 0 | 1 | 19.739 |
82387456 | GT-AG | 0 | 1.000000099473604e-05 | 3935 | rna-XM_033943151.1 15214374 | 8 | 194030781 | 194034715 | Geotrypetes seraphini 260995 | CAG|GTCAGCAACT...CTCATCTTATAC/ATGATTCTCATC...CACAG|TAC | 2 | 1 | 26.881 |
82387457 | GT-AG | 0 | 1.000000099473604e-05 | 3965 | rna-XM_033943151.1 15214374 | 9 | 194026661 | 194030625 | Geotrypetes seraphini 260995 | CAG|GTGAGATACA...TATGTTTTCTCT/AACAGTTTGATA...CTAAG|TTT | 1 | 1 | 29.346 |
82387458 | GT-AG | 0 | 1.000000099473604e-05 | 5314 | rna-XM_033943151.1 15214374 | 10 | 194021138 | 194026451 | Geotrypetes seraphini 260995 | AGG|GTAAGATCCT...GTTTATTTAATG/ATTGTGTTTATT...TTCAG|CTC | 0 | 1 | 32.671 |
82387459 | GT-AG | 0 | 1.000000099473604e-05 | 4089 | rna-XM_033943151.1 15214374 | 11 | 194016909 | 194020997 | Geotrypetes seraphini 260995 | TAG|GTCAGTCTAG...CTATGCTTAAAA/AAAAACTTGACC...CTTAG|AAT | 2 | 1 | 34.897 |
82387460 | GT-AG | 0 | 1.000000099473604e-05 | 6156 | rna-XM_033943151.1 15214374 | 12 | 194010639 | 194016794 | Geotrypetes seraphini 260995 | ACT|GTGAGTACAA...TTCTATTTATTC/TATTTATTCATT...TATAG|TGT | 2 | 1 | 36.711 |
82387461 | GT-AG | 0 | 1.000000099473604e-05 | 2296 | rna-XM_033943151.1 15214374 | 13 | 194008184 | 194010479 | Geotrypetes seraphini 260995 | AGG|GTAAGTGCTA...GTTTTCTTTTTT/AGGAAACTGAAG...TACAG|TGA | 2 | 1 | 39.24 |
82387462 | GT-AG | 0 | 1.000000099473604e-05 | 5672 | rna-XM_033943151.1 15214374 | 14 | 194002388 | 194008059 | Geotrypetes seraphini 260995 | GAG|GTACAGTAAC...AAATTCTTGTAC/AACGTAATAACA...TTAAG|GAT | 0 | 1 | 41.212 |
82387463 | GT-AG | 0 | 1.000000099473604e-05 | 8444 | rna-XM_033943151.1 15214374 | 15 | 193993734 | 194002177 | Geotrypetes seraphini 260995 | CAG|GTAAAATAGT...TACATTTTGAAA/CAAATTATTATT...TGCAG|AAA | 0 | 1 | 44.552 |
82387464 | GT-AG | 0 | 1.000000099473604e-05 | 2956 | rna-XM_033943151.1 15214374 | 16 | 193989894 | 193992849 | Geotrypetes seraphini 260995 | CAG|GTGGGTTAGT...CTTTTTTTCTCT/TAGTTATTCATA...CACAG|TCC | 2 | 1 | 58.613 |
82387465 | GT-AG | 0 | 0.0004767876415635 | 1878 | rna-XM_033943151.1 15214374 | 17 | 193987876 | 193989753 | Geotrypetes seraphini 260995 | CAG|GTACCACACC...CATTCATTAATA/ATTCCATTCATT...CCCAG|GTG | 1 | 1 | 60.84 |
82387466 | GT-AG | 0 | 3.677219932935984e-05 | 15820 | rna-XM_033943151.1 15214374 | 18 | 193971927 | 193987746 | Geotrypetes seraphini 260995 | CAG|GTATAAAAGA...TACTCCTTGAAC/CTGTTTTTGACC...TTTAG|GCC | 1 | 1 | 62.892 |
82387467 | GT-AG | 0 | 1.000000099473604e-05 | 6473 | rna-XM_033943151.1 15214374 | 19 | 193965122 | 193971594 | Geotrypetes seraphini 260995 | AAA|GTAAGATTCT...GAGTTTGTAAAG/GAGTTTGTAAAG...TCTAG|GTT | 0 | 1 | 68.172 |
82387468 | GT-AG | 0 | 1.000000099473604e-05 | 5221 | rna-XM_033943151.1 15214374 | 20 | 193959767 | 193964987 | Geotrypetes seraphini 260995 | CAG|GTTAGTATAT...TTCTTCTAAATG/TTTCTTCTAAAT...GGTAG|GAG | 2 | 1 | 70.304 |
82387469 | GT-AG | 0 | 1.000000099473604e-05 | 4959 | rna-XM_033943151.1 15214374 | 21 | 193954679 | 193959637 | Geotrypetes seraphini 260995 | CAG|GTAAGTGATT...CTGGCTTTGAAA/GATGTCTTTATG...TACAG|GTA | 2 | 1 | 72.356 |
82387470 | GT-AG | 0 | 1.000000099473604e-05 | 3655 | rna-XM_033943151.1 15214374 | 22 | 193950776 | 193954430 | Geotrypetes seraphini 260995 | GAG|GTGACGTACT...TTCTCTTTGATT/TTCTCTTTGATT...TGTAG|GTG | 1 | 1 | 76.3 |
82387471 | GC-AG | 0 | 1.000000099473604e-05 | 2496 | rna-XM_033943151.1 15214374 | 23 | 193948111 | 193950606 | Geotrypetes seraphini 260995 | AAG|GCAAGTGCAA...ACTGCCTTGATT/ATTGTGCTTATT...TACAG|GTA | 2 | 1 | 78.988 |
82387472 | GT-AG | 0 | 1.000000099473604e-05 | 1665 | rna-XM_033943151.1 15214374 | 24 | 193946244 | 193947908 | Geotrypetes seraphini 260995 | CAG|GTACGAGGAT...TCTGCTTTACAT/CTCTGCTTTACA...CACAG|ACC | 0 | 1 | 82.201 |
82387473 | GT-AG | 0 | 1.000000099473604e-05 | 7566 | rna-XM_033943151.1 15214374 | 25 | 193938547 | 193946112 | Geotrypetes seraphini 260995 | TAG|GTAAGTTGTG...AGCTTTTTAGAA/TATTTTTTGAGC...TTCAG|AGA | 2 | 1 | 84.285 |
82387474 | GT-AG | 0 | 1.000000099473604e-05 | 1789 | rna-XM_033943151.1 15214374 | 26 | 193936600 | 193938388 | Geotrypetes seraphini 260995 | AAG|GTACAGACAG...CTAGGTTTAACT/CTAGGTTTAACT...TTCAG|TAC | 1 | 1 | 86.798 |
82387475 | GT-AG | 0 | 1.000000099473604e-05 | 3982 | rna-XM_033943151.1 15214374 | 27 | 193932463 | 193936444 | Geotrypetes seraphini 260995 | AAG|GTGAGAATTA...TGTTCTTTATAC/ATGTTCTTTATA...CGCAG|ATA | 0 | 1 | 89.264 |
82387476 | GT-AG | 0 | 4.163579266280223e-05 | 2673 | rna-XM_033943151.1 15214374 | 28 | 193929583 | 193932255 | Geotrypetes seraphini 260995 | AGA|GTAAGTTCGG...TTATTATTAATG/TTATTATTAATG...TCCAG|GTC | 0 | 1 | 92.556 |
82387477 | GT-AG | 0 | 1.000000099473604e-05 | 1509 | rna-XM_033943151.1 15214374 | 29 | 193927979 | 193929487 | Geotrypetes seraphini 260995 | CAG|GTAAGAAAAC...TTTTTTTTGTCC/GGTTGGTTTACT...GCCAG|GCT | 2 | 1 | 94.067 |
82387478 | GT-AG | 0 | 1.000000099473604e-05 | 13277 | rna-XM_033943151.1 15214374 | 30 | 193914577 | 193927853 | Geotrypetes seraphini 260995 | CAG|GTGTGGAAGC...TGAGTCATGACC/TGAGTCATGACC...TGTAG|GTG | 1 | 1 | 96.055 |
82403205 | GT-AG | 0 | 0.0007261177207855 | 328 | rna-XM_033943151.1 15214374 | 1 | 194066150 | 194066477 | Geotrypetes seraphini 260995 | CAT|GTAAGCATTC...TTATCTGTAACT/TATTATCTGATC...TGCAG|ATT | 0 | 2.227 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);