introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
33 rows where transcript_id = 623754
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3436789 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-EDS130_LOCUS559 623754 | 1 | 1557653 | 1557699 | Adineta ricciae 249248 | ATT|GTAAGAAATT...CCGACATAAGTT/AAGTAGCCGACA...TTTAG|TGT | 1 | 1 | 2.433 |
3436790 | GT-AG | 0 | 0.581413602674143 | 58 | rna-EDS130_LOCUS559 623754 | 2 | 1557801 | 1557858 | Adineta ricciae 249248 | GGA|GTATGCTTGA...TAGCTCTAAGCT/ATAGCTCTAAGC...TGTAG|GAT | 0 | 1 | 3.726 |
3436791 | GT-AG | 0 | 0.0004347586995329 | 66 | rna-EDS130_LOCUS559 623754 | 3 | 1558078 | 1558143 | Adineta ricciae 249248 | CGA|GTAATTTCAT...TTTTTTTTAAAA/TTTTTTTTTAAA...ATCAG|GAT | 0 | 1 | 6.531 |
3436792 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-EDS130_LOCUS559 623754 | 4 | 1558214 | 1558268 | Adineta ricciae 249248 | ATA|GTGAGAAAAG...TCTTTTTTTATG/ATGTCATTCATT...TTAAG|CAC | 1 | 1 | 7.427 |
3436793 | GT-AG | 0 | 1.000000099473604e-05 | 258 | rna-EDS130_LOCUS559 623754 | 5 | 1558540 | 1558797 | Adineta ricciae 249248 | CGA|GTAAGGACAC...CATTTTTTAATC/CATTTTTTAATC...TTTAG|TAT | 2 | 1 | 10.898 |
3436794 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-EDS130_LOCUS559 623754 | 6 | 1558862 | 1558916 | Adineta ricciae 249248 | CTT|GTAAGAAGAA...TATACTTTGATC/TTTTTATTGAAT...TTAAG|GAA | 0 | 1 | 11.717 |
3436795 | GT-AG | 0 | 1.3818669208880289e-05 | 56 | rna-EDS130_LOCUS559 623754 | 7 | 1559022 | 1559077 | Adineta ricciae 249248 | AAA|GTAATTTCGA...ATGTTCGTATAA/AATGTTCGTATA...TTTAG|GAG | 0 | 1 | 13.062 |
3436796 | GT-AG | 0 | 0.069129018095287 | 54 | rna-EDS130_LOCUS559 623754 | 8 | 1559168 | 1559221 | Adineta ricciae 249248 | GAC|GTAACATTTC...AAGCCTTTATCC/TTATTACTCAAT...TTTAG|TTT | 0 | 1 | 14.214 |
3436797 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-EDS130_LOCUS559 623754 | 9 | 1559288 | 1559337 | Adineta ricciae 249248 | TTG|GTAAGAAGAA...CTATTTTTCTTT/AAAAGATCTATT...CATAG|GAT | 0 | 1 | 15.06 |
3436798 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-EDS130_LOCUS559 623754 | 10 | 1559440 | 1559490 | Adineta ricciae 249248 | AAA|GTGAGAAATC...TCTACTATGAAG/TCTACTATGAAG...TGTAG|AGA | 0 | 1 | 16.366 |
3436799 | GT-AG | 0 | 3.191245983353856e-05 | 62 | rna-EDS130_LOCUS559 623754 | 11 | 1559577 | 1559638 | Adineta ricciae 249248 | AAA|GTAAGTTTTT...ATTGCATTCATT/ATTGCATTCATT...TCAAG|AGC | 2 | 1 | 17.467 |
3436800 | GT-AG | 0 | 3.552762006694281e-05 | 447 | rna-EDS130_LOCUS559 623754 | 12 | 1559731 | 1560177 | Adineta ricciae 249248 | CAA|GTACGTCGAT...TTCTCATTATTT/TTCTTTCTCATT...TTTAG|ACG | 1 | 1 | 18.645 |
3436801 | GT-AG | 0 | 0.0003693407490491 | 68 | rna-EDS130_LOCUS559 623754 | 13 | 1560240 | 1560307 | Adineta ricciae 249248 | CAT|GTAATCGAGT...CTTTTCTTATCT/ACTTTTCTTATC...TCAAG|GAA | 0 | 1 | 19.439 |
3436802 | GT-AG | 0 | 1.000000099473604e-05 | 48 | rna-EDS130_LOCUS559 623754 | 14 | 1560458 | 1560505 | Adineta ricciae 249248 | ATG|GTACGACTTT...TGAACATTATCA/ATTATCATCATC...TTTAG|AAT | 0 | 1 | 21.36 |
3436803 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-EDS130_LOCUS559 623754 | 15 | 1561118 | 1561170 | Adineta ricciae 249248 | AAA|GTAAGATCTA...TGGCATTTGAAA/AATCGATTCAAC...TTCAG|GTT | 0 | 1 | 29.197 |
3436804 | GT-AG | 0 | 0.0011045456931931 | 51 | rna-EDS130_LOCUS559 623754 | 16 | 1562233 | 1562283 | Adineta ricciae 249248 | CGA|GTATGATTGT...ATCGCTTTATGA/ATTTGAATAATA...TCTAG|GCA | 0 | 1 | 42.797 |
3436805 | GT-AG | 0 | 4.4882542620735896e-05 | 64 | rna-EDS130_LOCUS559 623754 | 17 | 1562419 | 1562482 | Adineta ricciae 249248 | ACG|GTTTGTATAT...ACATTCTTGTCC/ATCGTGCTCATT...GTTAG|ATT | 0 | 1 | 44.526 |
3436806 | GT-AG | 0 | 2.5274561114919816e-05 | 60 | rna-EDS130_LOCUS559 623754 | 18 | 1562567 | 1562626 | Adineta ricciae 249248 | CAT|GTAAAGCTCA...CATTTCTTATTT/TTGATATTCATT...TCCAG|AAC | 0 | 1 | 45.601 |
3436807 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-EDS130_LOCUS559 623754 | 19 | 1563310 | 1563360 | Adineta ricciae 249248 | CAG|GTAAATGTCT...CTATTCTTGAAT/TTTTTATTAATT...TCTAG|AGC | 2 | 1 | 54.348 |
3436808 | GT-AG | 0 | 0.0003074907244492 | 56 | rna-EDS130_LOCUS559 623754 | 20 | 1563683 | 1563738 | Adineta ricciae 249248 | CAA|GTAAATTTTC...TTTGCTTTCGCG/TTTCGCGTCATT...TTTAG|GCA | 0 | 1 | 58.471 |
3436809 | GT-AG | 0 | 1.000000099473604e-05 | 96 | rna-EDS130_LOCUS559 623754 | 21 | 1563883 | 1563978 | Adineta ricciae 249248 | GAT|GTTAGTATAC...TTCTCATTATCT/CATTTTCTCATT...TTTAG|ATT | 0 | 1 | 60.315 |
3436810 | GT-AG | 0 | 0.001508285127327 | 64 | rna-EDS130_LOCUS559 623754 | 22 | 1564141 | 1564204 | Adineta ricciae 249248 | AAA|GTATATAAAT...ATCTCTTTGTTT/TGTTTCGTCATT...TACAG|ATC | 0 | 1 | 62.39 |
3436811 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-EDS130_LOCUS559 623754 | 23 | 1564346 | 1564403 | Adineta ricciae 249248 | GCT|GTGAGGCTTC...TCATTTTTATTT/ATCATTTTTATT...CTCAG|GGA | 0 | 1 | 64.195 |
3436812 | GC-AG | 0 | 1.000000099473604e-05 | 54 | rna-EDS130_LOCUS559 623754 | 24 | 1564586 | 1564639 | Adineta ricciae 249248 | CAA|GCACGTTGAT...AGCTTCTTAAAT/TAGCTTCTTAAA...TGTAG|GGA | 2 | 1 | 66.526 |
3436813 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-EDS130_LOCUS559 623754 | 25 | 1564764 | 1564816 | Adineta ricciae 249248 | CAG|GTTCGTTTCT...TCTTCTTCAATA/ATCTTCTTCAAT...TCTAG|TTC | 0 | 1 | 68.114 |
3436814 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-EDS130_LOCUS559 623754 | 26 | 1565567 | 1565620 | Adineta ricciae 249248 | AAG|GTAAGAGAAA...TTGCTTTTCACT/TTGCTTTTCACT...TTTAG|GTT | 0 | 1 | 77.718 |
3436815 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-EDS130_LOCUS559 623754 | 27 | 1565717 | 1565773 | Adineta ricciae 249248 | CTC|GTAAGAATCG...CGTTTTTTAAAT/CGTTTTTTAAAT...AAAAG|GAT | 0 | 1 | 78.947 |
3436816 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-EDS130_LOCUS559 623754 | 28 | 1565972 | 1566022 | Adineta ricciae 249248 | AAA|GTAAGAAATT...AGAGTCCAAGTA/TGTAAAATAATA...TTCAG|CTT | 0 | 1 | 81.483 |
3436817 | GT-AG | 0 | 3.107375276771654e-05 | 52 | rna-EDS130_LOCUS559 623754 | 29 | 1566395 | 1566446 | Adineta ricciae 249248 | CAA|GTAAATCATT...CTTGCTTTGAAT/TTTGAATTAATC...TTTAG|TTT | 0 | 1 | 86.247 |
3436818 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-EDS130_LOCUS559 623754 | 30 | 1566775 | 1566833 | Adineta ricciae 249248 | AAG|GTAAATCGAT...TAGATTTTAGTT/ATTTTAGTTATG...CATAG|GTA | 1 | 1 | 90.447 |
3436819 | GT-AG | 0 | 1.000000099473604e-05 | 279 | rna-EDS130_LOCUS559 623754 | 31 | 1566959 | 1567237 | Adineta ricciae 249248 | AAA|GTGATTATGA...AAGTTATTGATA/AAGTTATTGATA...TCTAG|GAA | 0 | 1 | 92.048 |
3436820 | GT-AG | 0 | 6.495413880268328e-05 | 514 | rna-EDS130_LOCUS559 623754 | 32 | 1567364 | 1567877 | Adineta ricciae 249248 | AAA|GTAAGTTTAA...GATGCTTTCATT/GATGCTTTCATT...TGTAG|CTT | 0 | 1 | 93.661 |
3436821 | GT-AG | 0 | 1.9137432215382205e-05 | 65 | rna-EDS130_LOCUS559 623754 | 33 | 1567938 | 1568002 | Adineta ricciae 249248 | GCT|GTAAGTAGAC...TTTTCGTTATTA/TTTCATCTCATT...TTCAG|GCA | 0 | 1 | 94.43 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);