introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 720784
This data as json, CSV (advanced)
Suggested facets: score, length
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822303 | GT-AG | 0 | 0.4490808868661202 | 348 | rna-gnl|I4U23|001854-T1 720784 | 1 | 5905622 | 5905969 | Adineta vaga 104782 | TCA|GTAACTTTCT...TTTTCCTTTCTA/TCCTTTCTAATC...TTAAG|GTG | 1 | 1 | 0.991 |
3822304 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-gnl|I4U23|001854-T1 720784 | 2 | 5905041 | 5905108 | Adineta vaga 104782 | CAG|GTAATTAGAT...GAAATCTTAATC/ATTTCTTTTACT...TTCAG|GTA | 1 | 1 | 9.756 |
3822305 | GT-AG | 0 | 1.000000099473604e-05 | 83 | rna-gnl|I4U23|001854-T1 720784 | 3 | 5904823 | 5904905 | Adineta vaga 104782 | CTG|GTAAAATACG...ATTTCTTTCATC/ATTTCTTTCATC...TCAAG|CGC | 1 | 1 | 12.062 |
3822306 | GT-AG | 0 | 1.000000099473604e-05 | 462 | rna-gnl|I4U23|001854-T1 720784 | 4 | 5904277 | 5904738 | Adineta vaga 104782 | TCG|GTGAGTGAAT...CACTTCTTACCA/CTGACTTTCACT...ATCAG|GTT | 1 | 1 | 13.497 |
3822307 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|001854-T1 720784 | 5 | 5903900 | 5903964 | Adineta vaga 104782 | CTG|GTAGGAAGAC...TTATTATTATTA/ATTATTATTATT...TTTAG|GTT | 1 | 1 | 18.828 |
3822308 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|001854-T1 720784 | 6 | 5903470 | 5903530 | Adineta vaga 104782 | GAA|GTAAGTCACG...AAATGCTTATCT/GCTTATCTGATA...ACTAG|TCG | 1 | 1 | 25.132 |
3822309 | GT-AG | 0 | 0.0012459070162771 | 719 | rna-gnl|I4U23|001854-T1 720784 | 7 | 5902577 | 5903295 | Adineta vaga 104782 | CGG|GTAACTTACA...AAAATATTAATT/AAAATATTAATT...CCTAG|GAA | 1 | 1 | 28.105 |
3822310 | GT-AG | 0 | 1.000000099473604e-05 | 137 | rna-gnl|I4U23|001854-T1 720784 | 8 | 5902293 | 5902429 | Adineta vaga 104782 | AAG|GTTTGATATA...AAGATTTTAGAG/TAAGATTTTAGA...TCTAG|GAC | 1 | 1 | 30.617 |
3822311 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|001854-T1 720784 | 9 | 5902154 | 5902208 | Adineta vaga 104782 | TTG|GTGAGTTGAC...TTATTCTTGTTT/CATCGTCTCAAT...CCTAG|GTG | 1 | 1 | 32.052 |
3822312 | GT-AG | 0 | 2.236911706642628e-05 | 55 | rna-gnl|I4U23|001854-T1 720784 | 10 | 5901958 | 5902012 | Adineta vaga 104782 | TTA|GTAAATAAAA...CGAATTTTAATT/CGAATTTTAATT...TTTAG|TTG | 1 | 1 | 34.461 |
3822313 | GT-AG | 0 | 1.000000099473604e-05 | 76 | rna-gnl|I4U23|001854-T1 720784 | 11 | 5901708 | 5901783 | Adineta vaga 104782 | ATG|GTAAGAAAAA...TTTTTTTTATTA/TTTTTTTTTATT...AATAG|GAT | 1 | 1 | 37.434 |
3822314 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|001854-T1 720784 | 12 | 5901275 | 5901338 | Adineta vaga 104782 | CAA|GTAAAATTAA...TGCTGTTTGATT/TGCTGTTTGATT...TTTAG|TTA | 1 | 1 | 43.738 |
3822315 | GT-AG | 0 | 0.0003697405945855 | 53 | rna-gnl|I4U23|001854-T1 720784 | 13 | 5901051 | 5901103 | Adineta vaga 104782 | ATA|GTAAGTTTGG...GGAATTTTGACT/GGAATTTTGACT...TTTAG|TAC | 1 | 1 | 46.66 |
3822316 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|001854-T1 720784 | 14 | 5900893 | 5900954 | Adineta vaga 104782 | CTG|GTAAGTTCAA...TTAATCTTATAA/TATAAGTTCATT...TTTAG|TCG | 1 | 1 | 48.3 |
3822317 | GT-AG | 0 | 0.0057358166020927 | 59 | rna-gnl|I4U23|001854-T1 720784 | 15 | 5900141 | 5900199 | Adineta vaga 104782 | TTG|GTATGTTCGA...GTTTTCTTCTCA/TTTCTTCTCAAA...ATTAG|ATC | 1 | 1 | 60.14 |
3822318 | GT-AG | 0 | 1.3012420424646872e-05 | 60 | rna-gnl|I4U23|001854-T1 720784 | 16 | 5899823 | 5899882 | Adineta vaga 104782 | GAT|GTAAGATCAT...TATATCTTAATT/TATATCTTAATT...TCTAG|GTT | 1 | 1 | 64.548 |
3822319 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-gnl|I4U23|001854-T1 720784 | 17 | 5899668 | 5899741 | Adineta vaga 104782 | TCG|GTAAGATTGA...ATTTTTTTAAAC/TTTAAACTTATT...TCTAG|ATT | 1 | 1 | 65.932 |
3822320 | GT-AG | 0 | 3.15648122131054e-05 | 63 | rna-gnl|I4U23|001854-T1 720784 | 18 | 5899437 | 5899499 | Adineta vaga 104782 | AAG|GTTTTATATG...TTTGTTTTGAAT/TTTGTTTTGAAT...TTTAG|ATC | 1 | 1 | 68.802 |
3822321 | GT-AG | 0 | 1.000000099473604e-05 | 97 | rna-gnl|I4U23|001854-T1 720784 | 19 | 5899256 | 5899352 | Adineta vaga 104782 | CTG|GTACGAGATC...TTTTTCTTTCTA/AGAGATTTGAAT...TTTAG|AAG | 1 | 1 | 70.237 |
3822322 | GT-AG | 0 | 1.000000099473604e-05 | 70 | rna-gnl|I4U23|001854-T1 720784 | 20 | 5899003 | 5899072 | Adineta vaga 104782 | CTG|GTAATAAATG...TTTCCCCTACAA/AATATGCAAACT...CATAG|ATC | 1 | 1 | 73.364 |
3822323 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|001854-T1 720784 | 21 | 5898861 | 5898918 | Adineta vaga 104782 | TTG|GTGAGTACAA...TTGATTTTGAAA/TTCAAATTCATC...TCAAG|GAA | 1 | 1 | 74.799 |
3822324 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|001854-T1 720784 | 22 | 5898359 | 5898413 | Adineta vaga 104782 | ATG|GTAAAAATAA...TTTTTTTTATCA/TTTTTTTTTATC...ATTAG|ATG | 1 | 1 | 82.436 |
3822325 | GT-AG | 0 | 0.0253123600560463 | 62 | rna-gnl|I4U23|001854-T1 720784 | 23 | 5898213 | 5898274 | Adineta vaga 104782 | TTG|GTATGTTTCT...ATATTTTTATTT/AATATTTTTATT...AATAG|GAT | 1 | 1 | 83.872 |
3822326 | GT-AG | 0 | 0.0097620404583088 | 64 | rna-gnl|I4U23|001854-T1 720784 | 24 | 5897960 | 5898023 | Adineta vaga 104782 | AAG|GTTTTCTTTC...CTTTCTTGAACA/AATTTGTTCATT...CATAG|ATC | 1 | 1 | 87.101 |
3822327 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|001854-T1 720784 | 25 | 5897818 | 5897875 | Adineta vaga 104782 | CAG|GTAAATGAGA...ATATTCTTATTT/TATATTCTTATT...CTTAG|GTT | 1 | 1 | 88.536 |
3822328 | GT-AG | 0 | 0.0689265220605828 | 58 | rna-gnl|I4U23|001854-T1 720784 | 26 | 5897676 | 5897733 | Adineta vaga 104782 | AAC|GTATGTATTG...TTTATCTTGATA/TATAATTTAATT...AATAG|TAT | 1 | 1 | 89.971 |
3822329 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|001854-T1 720784 | 27 | 5897305 | 5897357 | Adineta vaga 104782 | TCG|GTGAGTAATA...ATTTTCTTCGTC/TCGTCTCTAACG...AATAG|ATC | 1 | 1 | 95.404 |
3822330 | GT-AG | 0 | 0.0004062471593043 | 53 | rna-gnl|I4U23|001854-T1 720784 | 28 | 5897177 | 5897229 | Adineta vaga 104782 | TAA|GTAAGTTATT...TTATTTTTAATT/TTATTTTTAATT...AATAG|TCG | 1 | 1 | 96.685 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);