introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
23 rows where transcript_id = 34480527
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
193751214 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 1 | 500440 | 500492 | Syncephalastrum racemosum 13706 | AAA|GTAATTGCAC...GTGTCGTTATTC/TGTGTCGTTATT...AATAG|ATC | 0 | 1 | 2.791 |
193751215 | GT-AG | 0 | 0.0007039046736653 | 52 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 2 | 500581 | 500632 | Syncephalastrum racemosum 13706 | ATG|GTATGCGATC...CTCACCTGGACT/CCCTTTCTCACC...TACAG|GTT | 1 | 1 | 5.065 |
193751216 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 3 | 500700 | 500746 | Syncephalastrum racemosum 13706 | CTG|GTAAATGAGA...CTCGTCTAAACC/CCTCGTCTAAAC...CTTAG|GCT | 2 | 1 | 6.796 |
193751217 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 4 | 500832 | 500882 | Syncephalastrum racemosum 13706 | GAG|GTGAGTGAAG...GCTTCTTTGACA/GCTTCTTTGACA...ATTAG|ATC | 0 | 1 | 8.992 |
193751218 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 5 | 501231 | 501275 | Syncephalastrum racemosum 13706 | AAG|GTAAAGCTTT...CTCGTCTTACTA/TCTCGTCTTACT...TTCAG|GGT | 0 | 1 | 17.984 |
193751219 | GT-AG | 0 | 0.0010861973214236 | 46 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 6 | 501359 | 501404 | Syncephalastrum racemosum 13706 | AAC|GTACGTTGCC...ATACTCTCGACT/TCTCGACTAATT...ACTAG|CGT | 2 | 1 | 20.129 |
193751220 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 7 | 501466 | 501517 | Syncephalastrum racemosum 13706 | CAG|GTAAAGTTTT...CACGTCTCAACT/TCACGTCTCAAC...ACTAG|GAT | 0 | 1 | 21.705 |
193751221 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 8 | 501632 | 501681 | Syncephalastrum racemosum 13706 | GCT|GTCAGTACAG...GTACCTATACCA/CCATTGCTAACT...AACAG|ATC | 0 | 1 | 24.651 |
193751222 | GT-AG | 0 | 0.0009314330167779 | 47 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 9 | 501761 | 501807 | Syncephalastrum racemosum 13706 | TCC|GTAAGCTGGC...ACACATTTAACC/ACACATTTAACC...TGTAG|AGT | 1 | 1 | 26.693 |
193751223 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 10 | 501967 | 502016 | Syncephalastrum racemosum 13706 | CGA|GTAAGTCAAA...TAGACCCTGGTT/CCCTGGTTAAAC...TTTAG|ATA | 1 | 1 | 30.801 |
193751224 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 11 | 502088 | 502134 | Syncephalastrum racemosum 13706 | AAG|GTGAGTAGGA...TGATTTATAATG/ATGATACTGACG...TATAG|CTC | 0 | 1 | 32.636 |
193751225 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 12 | 502279 | 502340 | Syncephalastrum racemosum 13706 | AAT|GTAGGAACGT...GTATCCTCATCG/TGTATCCTCATC...TCTAG|CTT | 0 | 1 | 36.357 |
193751226 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 13 | 502484 | 502541 | Syncephalastrum racemosum 13706 | GAG|GTGAGTGAAG...TCATGTTTGAAA/AGCGTACTCATG...GATAG|GGA | 2 | 1 | 40.052 |
193751227 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 14 | 502649 | 502697 | Syncephalastrum racemosum 13706 | TTG|GTAAGGTAGC...TTTGCTTTGATA/TTTGCTTTGATA...GTTAG|CCT | 1 | 1 | 42.817 |
193751228 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 15 | 502820 | 502867 | Syncephalastrum racemosum 13706 | CAG|GTACAGTGGC...TTTGTCTTGTTC/CTTGTTCTAACG...AGTAG|TTC | 0 | 1 | 45.969 |
193751229 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 16 | 503031 | 503097 | Syncephalastrum racemosum 13706 | CAG|GTGCGCCCGA...TTTTTTTTATTT/TTTTTTTTTATT...TAAAG|TCA | 1 | 1 | 50.181 |
193751230 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 17 | 503271 | 503322 | Syncephalastrum racemosum 13706 | GAG|GTAGAAGCTA...TGACCCTTTTCC/AGCGTATTGACC...GGTAG|GCT | 0 | 1 | 54.651 |
193751231 | GC-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 18 | 503450 | 503498 | Syncephalastrum racemosum 13706 | GAG|GCAAGTATAG...GTACCCAAAACT/CCAAAACTTATG...CCCAG|GTC | 1 | 1 | 57.933 |
193751232 | GT-AG | 0 | 3.6213501200171406e-05 | 50 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 19 | 503675 | 503724 | Syncephalastrum racemosum 13706 | ATG|GTAATGTTTG...GTCCTCTTATAC/CGTCCTCTTATA...AATAG|ATC | 0 | 1 | 62.481 |
193751233 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 20 | 503839 | 503886 | Syncephalastrum racemosum 13706 | AAG|GTAAGGGTAT...GGCTACTCATCT/CGGCTACTCATC...GACAG|GGC | 0 | 1 | 65.426 |
193751234 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 21 | 504073 | 504132 | Syncephalastrum racemosum 13706 | GAG|GTGTGTGATT...TATTCTTTCATG/TATTCTTTCATG...TATAG|GGT | 0 | 1 | 70.233 |
193751235 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 22 | 504581 | 504629 | Syncephalastrum racemosum 13706 | AAG|GTAACACGAA...TGAAGCTGACCT/GTGAAGCTGACC...GAAAG|ATA | 1 | 1 | 81.809 |
193751236 | GT-AG | 0 | 1.000000099473604e-05 | 79 | rna-gnl|WGS:MCGN|BCR43DRAFT_mRNA465295 34480527 | 23 | 504818 | 504896 | Syncephalastrum racemosum 13706 | TTC|GTAAGTAGAG...GTGTACTCACCT/AGTGTACTCACC...AACAG|AAA | 0 | 1 | 86.667 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);