introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
24 rows where transcript_id = 720753
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821730 | GT-AG | 0 | 2.392559041555063e-05 | 53 | rna-gnl|I4U23|001262-T1 720753 | 1 | 4137157 | 4137209 | Adineta vaga 104782 | TGA|GTAAATAAAT...CATATGTTAACT/TGTTAACTTATT...TTCAG|AGA | 2 | 1 | 1.956 |
3821731 | GT-AG | 0 | 1.000000099473604e-05 | 117 | rna-gnl|I4U23|001262-T1 720753 | 2 | 4136814 | 4136930 | Adineta vaga 104782 | CAG|GTAGATAGAA...GATCTCTTATAT/TTTATTTTTATA...CTTAG|AAA | 0 | 1 | 4.984 |
3821732 | GT-AG | 0 | 2.4672935364964523e-05 | 65 | rna-gnl|I4U23|001262-T1 720753 | 3 | 4136586 | 4136650 | Adineta vaga 104782 | TGT|GTAAGTGATC...TCTTTCTTATAT/TTCTTTCTTATA...TACAG|TTG | 1 | 1 | 7.168 |
3821733 | GT-AG | 0 | 5.044691311603269e-05 | 63 | rna-gnl|I4U23|001262-T1 720753 | 4 | 4135927 | 4135989 | Adineta vaga 104782 | AAT|GTAAGTTATC...TCAATCGTAATA/GTAATAATCATT...TACAG|AAT | 0 | 1 | 15.153 |
3821734 | GT-AG | 0 | 0.0001843592412245 | 61 | rna-gnl|I4U23|001262-T1 720753 | 5 | 4135476 | 4135536 | Adineta vaga 104782 | CGA|GTATGAAATT...GTTTTCTTTTCG/TTTCGTTTCAAA...TCTAG|GCA | 0 | 1 | 20.378 |
3821735 | GT-AG | 0 | 2.3512190793640792e-05 | 62 | rna-gnl|I4U23|001262-T1 720753 | 6 | 4135252 | 4135313 | Adineta vaga 104782 | GCT|GTAAGATTAT...TATTCTTTTTCA/TTCTTTTTCAAT...ATTAG|AAT | 0 | 1 | 22.548 |
3821736 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|001262-T1 720753 | 7 | 4134932 | 4134984 | Adineta vaga 104782 | AAA|GTAATACGAA...CTTTTTTTATTT/CCTTTTTTTATT...TGTAG|TTA | 0 | 1 | 26.125 |
3821737 | GT-AG | 0 | 1.000000099473604e-05 | 260 | rna-gnl|I4U23|001262-T1 720753 | 8 | 4134490 | 4134749 | Adineta vaga 104782 | ATG|GTAAGATGAC...TTTTTCTTAAAA/TTTTTTCTTAAA...TTTAG|GTG | 2 | 1 | 28.564 |
3821738 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-gnl|I4U23|001262-T1 720753 | 9 | 4133691 | 4133761 | Adineta vaga 104782 | ATG|GTAAGGAAAA...GTTTTCTTATTC/TGTTTTCTTATT...GTTAG|GTA | 1 | 1 | 38.317 |
3821739 | GT-AG | 0 | 0.0005984364145869 | 52 | rna-gnl|I4U23|001262-T1 720753 | 10 | 4133147 | 4133198 | Adineta vaga 104782 | TGC|GTAAGCTAAT...TTATCTTTTGCT/CTTTTGCTCAAT...AATAG|AAG | 1 | 1 | 44.909 |
3821740 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-gnl|I4U23|001262-T1 720753 | 11 | 4132989 | 4133056 | Adineta vaga 104782 | TTT|GTAAGTCAGA...ATTTATTTATTT/TGTTTATTCATT...TTTAG|TAG | 1 | 1 | 46.115 |
3821741 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|001262-T1 720753 | 12 | 4132864 | 4132916 | Adineta vaga 104782 | AAG|GTAATTTGAA...AAAATTTTTTCA/AGAGAATTAAAA...TGTAG|CTG | 1 | 1 | 47.079 |
3821742 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|001262-T1 720753 | 13 | 4132472 | 4132535 | Adineta vaga 104782 | AGA|GTAAGTAAAA...TTTTCGTTTGTT/TAAGAGTTAATT...CATAG|ATT | 2 | 1 | 51.474 |
3821743 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|001262-T1 720753 | 14 | 4132087 | 4132145 | Adineta vaga 104782 | CTG|GTAAGTAAAT...CTTTTCTTGATC/CTTTTCTTGATC...TTCAG|TCC | 1 | 1 | 55.841 |
3821744 | GT-AG | 0 | 1.000000099473604e-05 | 111 | rna-gnl|I4U23|001262-T1 720753 | 15 | 4131955 | 4132065 | Adineta vaga 104782 | CGT|GTGAGTAAAG...GTTTTTTTAATC/GTTTTTTTAATC...TTAAG|ATT | 1 | 1 | 56.123 |
3821745 | GT-AG | 0 | 0.0040329613721874 | 152 | rna-gnl|I4U23|001262-T1 720753 | 16 | 4131334 | 4131485 | Adineta vaga 104782 | TGG|GTAAACTATT...AATTTTTTATCA/GAATTTTTTATC...AATAG|ATA | 2 | 1 | 62.406 |
3821746 | GT-AG | 0 | 8.13155715621151e-05 | 59 | rna-gnl|I4U23|001262-T1 720753 | 17 | 4130668 | 4130726 | Adineta vaga 104782 | ATG|GTACAAATGA...TTTCTCTTATCA/TTTTCTCTTATC...TTTAG|CCT | 0 | 1 | 70.539 |
3821747 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-gnl|I4U23|001262-T1 720753 | 18 | 4130087 | 4130174 | Adineta vaga 104782 | AAT|GTGAGTAATT...AAGTTATTGATT/AAGTTATTGATT...TTTAG|CTG | 1 | 1 | 77.144 |
3821748 | GT-AG | 0 | 1.34126139754473e-05 | 57 | rna-gnl|I4U23|001262-T1 720753 | 19 | 4129505 | 4129561 | Adineta vaga 104782 | TTG|GTAGGTTGAA...TCATTTTCAATC/ATAGTTTTCATT...AACAG|ATT | 1 | 1 | 84.177 |
3821749 | GT-AG | 0 | 1.000000099473604e-05 | 214 | rna-gnl|I4U23|001262-T1 720753 | 20 | 4129243 | 4129456 | Adineta vaga 104782 | ATG|GTGAGTTCAA...TATTCTTTAACA/ATTTATTTCATT...TTTAG|CAG | 1 | 1 | 84.82 |
3821750 | GT-AG | 0 | 1.000000099473604e-05 | 72 | rna-gnl|I4U23|001262-T1 720753 | 21 | 4128722 | 4128793 | Adineta vaga 104782 | CAA|GTAAGAGACA...TATTCTTTGATT/TCCATTCTTATT...ATTAG|TCA | 0 | 1 | 90.836 |
3821751 | GT-AG | 0 | 1.000000099473604e-05 | 63 | rna-gnl|I4U23|001262-T1 720753 | 22 | 4128444 | 4128506 | Adineta vaga 104782 | TCG|GTAAAATAGT...AAATTTTTAATC/AAATTTTTAATC...TCTAG|AAT | 2 | 1 | 93.717 |
3821752 | GT-AG | 0 | 1.000000099473604e-05 | 69 | rna-gnl|I4U23|001262-T1 720753 | 23 | 4128266 | 4128334 | Adineta vaga 104782 | AAT|GTTCGTACTC...ATAATTTAAACC/AATAATTTAAAC...CGTAG|GAT | 0 | 1 | 95.177 |
3821753 | GT-AG | 0 | 0.0002463137222327 | 59 | rna-gnl|I4U23|001262-T1 720753 | 24 | 4128095 | 4128153 | Adineta vaga 104782 | TTA|GTAAATCTCA...TATTTTCTAACG/TATTTTCTAACG...TGTAG|ATA | 1 | 1 | 96.677 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);