introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 15214381
This data as json, CSV (advanced)
Suggested facets: score, phase, in_cds
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
82387656 | GT-AG | 0 | 0.0014277178783297 | 221842 | rna-XM_033918742.1 15214381 | 2 | 418328802 | 418550643 | Geotrypetes seraphini 260995 | AGA|GTAAGCTTAT...GTGTCCTTTTCT/ATCTTACTAATT...TATAG|CTC | 1 | 1 | 5.681 |
82387657 | GT-AG | 0 | 8.87341761180923e-05 | 29934 | rna-XM_033918742.1 15214381 | 3 | 418298722 | 418328655 | Geotrypetes seraphini 260995 | GAG|GTATTAGGTC...ATATCTTTGACA/CTATTGTTTATT...TACAG|GTG | 0 | 1 | 8.106 |
82387658 | GT-AG | 0 | 1.000000099473604e-05 | 178725 | rna-XM_033918742.1 15214381 | 4 | 418119855 | 418298579 | Geotrypetes seraphini 260995 | GTG|GTAAGTCATT...CATGACTTGATA/CATGACTTGATA...TTCAG|AGG | 1 | 1 | 10.465 |
82387659 | GT-AG | 0 | 5.477241478331716e-05 | 3635 | rna-XM_033918742.1 15214381 | 5 | 418116031 | 418119665 | Geotrypetes seraphini 260995 | CAG|GTAGGTTTTT...AGAACTTTATTC/TTTATTCTGACC...AACAG|GTG | 1 | 1 | 13.605 |
82387660 | GT-AG | 0 | 1.000000099473604e-05 | 4414 | rna-XM_033918742.1 15214381 | 6 | 418111506 | 418115919 | Geotrypetes seraphini 260995 | GAG|GTAGGAATGA...CTTTCCTCATTT/CCTTTCCTCATT...CTCAG|AGC | 1 | 1 | 15.449 |
82387661 | GT-AG | 0 | 1.000000099473604e-05 | 5882 | rna-XM_033918742.1 15214381 | 7 | 418105612 | 418111493 | Geotrypetes seraphini 260995 | AAG|GTTGGTGTGT...ATGTTATTACCA/TATGTTATTACC...TCCAG|TTC | 1 | 1 | 15.648 |
82387662 | GT-AG | 0 | 0.0001733506933084 | 2117 | rna-XM_033918742.1 15214381 | 8 | 418103225 | 418105341 | Geotrypetes seraphini 260995 | AAG|GTAATTTTTA...CTCTCCTAAATT/CAGTTCATGATT...TCTAG|CCT | 1 | 1 | 20.133 |
82387663 | GT-AG | 0 | 4.9230612631249165e-05 | 21743 | rna-XM_033918742.1 15214381 | 9 | 418080900 | 418102642 | Geotrypetes seraphini 260995 | GAG|GTAAATTTTA...TGTTTTTCAATA/ATGTTTTTCAAT...TTTAG|TTC | 1 | 1 | 29.801 |
82387664 | GT-AG | 0 | 4.283175763419872e-05 | 18047 | rna-XM_033918742.1 15214381 | 10 | 418062719 | 418080765 | Geotrypetes seraphini 260995 | GAG|GTAAATTAAA...TTTTTTTTAATA/TTTTTTTTAATA...TTTAG|CAA | 0 | 1 | 32.027 |
82387665 | GT-AG | 0 | 1.5379301121950457e-05 | 16885 | rna-XM_033918742.1 15214381 | 11 | 418045689 | 418062573 | Geotrypetes seraphini 260995 | CAA|GTAAGTCTTT...TCATTCCTATCC/CCACTTCTAACC...ATCAG|AGC | 1 | 1 | 34.435 |
82387666 | GT-AG | 0 | 0.0084108411331437 | 807 | rna-XM_033918742.1 15214381 | 12 | 418044576 | 418045382 | Geotrypetes seraphini 260995 | ATG|GTATGTTGTC...TCTGTTTTATCA/TTCTGTTTTATC...TCCAG|TTC | 1 | 1 | 39.518 |
82387667 | GT-AG | 0 | 1.000000099473604e-05 | 42376 | rna-XM_033918742.1 15214381 | 13 | 418002006 | 418044381 | Geotrypetes seraphini 260995 | CAG|GTATGGAAAT...GTTTCCTTTTTT/CCAACATTTATT...ATCAG|TGG | 0 | 1 | 42.741 |
82387668 | GT-AG | 0 | 1.000000099473604e-05 | 2463 | rna-XM_033918742.1 15214381 | 14 | 417999516 | 418001978 | Geotrypetes seraphini 260995 | CAT|GTGAGTAATA...GGATGTTTGACA/GTTATATTTAAT...TTTAG|GTA | 0 | 1 | 43.189 |
82387669 | GT-AG | 0 | 1.000000099473604e-05 | 11641 | rna-XM_033918742.1 15214381 | 15 | 417987757 | 417999397 | Geotrypetes seraphini 260995 | CAG|GTAATGTGCC...TTTGCTTTAATA/TTAATATTAATA...TACAG|TTC | 1 | 1 | 45.15 |
82387670 | GT-AG | 0 | 1.000000099473604e-05 | 107 | rna-XM_033918742.1 15214381 | 16 | 417987062 | 417987168 | Geotrypetes seraphini 260995 | AAG|GTAGGATTTG...GAACCTGTAAAT/GTTCCATTCATT...AATAG|ATT | 1 | 1 | 54.917 |
82387671 | GT-AG | 0 | 1.000000099473604e-05 | 315 | rna-XM_033918742.1 15214381 | 17 | 417986729 | 417987043 | Geotrypetes seraphini 260995 | AAG|GTAAAGTACA...ATGTTCTCAATT/AATGTTCTCAAT...AGCAG|TGT | 1 | 1 | 55.216 |
82387672 | GT-AG | 0 | 1.000000099473604e-05 | 10372 | rna-XM_033918742.1 15214381 | 18 | 417976259 | 417986630 | Geotrypetes seraphini 260995 | AAA|GTAAGTACCT...AAAACCTAAATG/TAAATGCTAACC...CTTAG|ATT | 0 | 1 | 56.844 |
82387673 | GT-AG | 0 | 1.000000099473604e-05 | 28114 | rna-XM_033918742.1 15214381 | 19 | 417947885 | 417975998 | Geotrypetes seraphini 260995 | AAA|GTGAGTCCCA...ATACTTTTAATT/TTCAGTCTCATC...TACAG|AGG | 2 | 1 | 61.163 |
82387674 | GT-AG | 0 | 4.061518294815313e-05 | 11899 | rna-XM_033918742.1 15214381 | 20 | 417935895 | 417947793 | Geotrypetes seraphini 260995 | GAG|GTACAGTATG...TTTGTCTTTGCT/TGCAGCTTAACA...TTTAG|CTA | 0 | 1 | 62.674 |
82387675 | GT-AG | 0 | 0.0070228679833882 | 26297 | rna-XM_033918742.1 15214381 | 21 | 417909388 | 417935684 | Geotrypetes seraphini 260995 | TCG|GTATGTATGA...TTAGTTTTAAAT/AATGTACTTATC...CTTAG|ACA | 0 | 1 | 66.163 |
82387676 | GT-AG | 0 | 2.920718334260992e-05 | 12338 | rna-XM_033918742.1 15214381 | 22 | 417896889 | 417909226 | Geotrypetes seraphini 260995 | AAG|GTAAGTTTCA...TTCTCCTTCCCC/TTTTTTCCCATA...AGCAG|CAA | 2 | 1 | 68.837 |
82387677 | GT-AG | 0 | 1.000000099473604e-05 | 22223 | rna-XM_033918742.1 15214381 | 23 | 417874654 | 417896876 | Geotrypetes seraphini 260995 | CAG|GTGAGAAGAC...TTCTTCTGATTT/ATTCTTCTGATT...TCCAG|GAA | 2 | 1 | 69.037 |
82387678 | GT-AG | 0 | 1.000000099473604e-05 | 22491 | rna-XM_033918742.1 15214381 | 24 | 417852050 | 417874540 | Geotrypetes seraphini 260995 | CAG|GTAAGAAGCA...TTTCTCTTCTCT/GTTCTTTTCAAA...AACAG|GTA | 1 | 1 | 70.914 |
82387679 | GT-AG | 0 | 1.000000099473604e-05 | 72411 | rna-XM_033918742.1 15214381 | 25 | 417779541 | 417851951 | Geotrypetes seraphini 260995 | GAG|GTAATATTTT...TTTTTCTTTTCT/ATGCTTTTTACT...TTAAG|TCG | 0 | 1 | 72.542 |
82387680 | GT-AG | 0 | 1.000000099473604e-05 | 59628 | rna-XM_033918742.1 15214381 | 26 | 417719789 | 417779416 | Geotrypetes seraphini 260995 | AAG|GTAAGGAAAT...CTTTTTTTAAAA/CTTTTTTTAAAA...TGCAG|GTG | 1 | 1 | 74.601 |
82387681 | GT-AG | 0 | 1.000000099473604e-05 | 35247 | rna-XM_033918742.1 15214381 | 27 | 417684366 | 417719612 | Geotrypetes seraphini 260995 | AGG|GTAAATACAA...AGTTTTTGGAAT/TGAGAATTCAAT...TATAG|GTA | 0 | 1 | 77.525 |
82387682 | GT-AG | 0 | 2.2561384255051907e-05 | 6886 | rna-XM_033918742.1 15214381 | 28 | 417677360 | 417684245 | Geotrypetes seraphini 260995 | AAG|GTACTAATTT...AACATTTTGACC/AACATTTTGACC...AATAG|AAT | 0 | 1 | 79.518 |
82387683 | GT-AG | 0 | 1.000000099473604e-05 | 100872 | rna-XM_033918742.1 15214381 | 29 | 417576333 | 417677204 | Geotrypetes seraphini 260995 | CAG|GTAAGAACAA...TTTTTTTTATCT/ATTTTTTTTATC...TACAG|TGC | 2 | 1 | 82.093 |
82387684 | GT-AG | 0 | 0.2570883002811207 | 5252 | rna-XM_033918742.1 15214381 | 30 | 417570795 | 417576046 | Geotrypetes seraphini 260995 | AAG|GTATCTAAAT...TCATTTTTAACA/TCTGTATTCATT...AACAG|CGT | 0 | 1 | 86.844 |
82387685 | GT-AG | 0 | 1.000000099473604e-05 | 13264 | rna-XM_033918742.1 15214381 | 31 | 417557352 | 417570615 | Geotrypetes seraphini 260995 | CAG|GTAATAATTG...TTCTCCTTTCTC/CTTTTCTACATG...TTTAG|GCA | 2 | 1 | 89.817 |
82387686 | GT-AG | 0 | 3.0411705820842905e-05 | 2346 | rna-XM_033918742.1 15214381 | 32 | 417554879 | 417557224 | Geotrypetes seraphini 260995 | AGG|GTAAGCACTT...TTTGTTTTATTT/TTTTGTTTTATT...ACTAG|GAA | 0 | 1 | 91.927 |
82387687 | GT-AG | 0 | 1.000000099473604e-05 | 24574 | rna-XM_033918742.1 15214381 | 33 | 417530179 | 417554752 | Geotrypetes seraphini 260995 | AGG|GTAAGTATTT...TAACACTTATTT/GTAACACTTATT...TAAAG|GAT | 0 | 1 | 94.02 |
82387688 | GT-AG | 0 | 1.000000099473604e-05 | 18387 | rna-XM_033918742.1 15214381 | 34 | 417511637 | 417530023 | Geotrypetes seraphini 260995 | CAG|GTGAGTTACA...TTGCTTTTAGCA/TTTGCTTTTAGC...AACAG|TGC | 2 | 1 | 96.595 |
82387689 | GT-AG | 0 | 1.000000099473604e-05 | 4702 | rna-XM_033918742.1 15214381 | 35 | 417506799 | 417511500 | Geotrypetes seraphini 260995 | GAG|GTGAGTGACA...ATAGTTTTACTT/GTTTTACTTACA...TCCAG|GAT | 0 | 1 | 98.854 |
82403210 | GT-AG | 0 | 0.1139436590597667 | 199139 | rna-XM_033918742.1 15214381 | 1 | 418550815 | 418749953 | Geotrypetes seraphini 260995 | CAG|GTACCCCCAA...TCTGTCTTATCT/TTCTGTCTTATC...TACAG|GCT | 0 | 4.269 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);