introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 720825
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822921 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|000290-T1 720825 | 1 | 1221506 | 1221555 | Adineta vaga 104782 | TGG|GTAAATTGAA...AATTTTTCAAAG/GAATTTTTCAAA...TTTAG|ATT | 2 | 1 | 3.468 |
3822922 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|000290-T1 720825 | 2 | 1221688 | 1221748 | Adineta vaga 104782 | TCG|GTAAGATAAC...AAATTCTTATTA/AAAATTCTTATT...TTAAG|TTC | 2 | 1 | 6.21 |
3822923 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|000290-T1 720825 | 3 | 1221949 | 1222003 | Adineta vaga 104782 | TTG|GTAAATAATT...TTTTCTTTTGTT/GAATTTTTCAAA...TGTAG|CTC | 1 | 1 | 10.363 |
3822924 | GT-AG | 0 | 1.000000099473604e-05 | 165 | rna-gnl|I4U23|000290-T1 720825 | 4 | 1222255 | 1222419 | Adineta vaga 104782 | TCG|GTAAAGATCT...TGTATCTTCATG/TGTATCTTCATG...TATAG|CAA | 0 | 1 | 15.576 |
3822925 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|000290-T1 720825 | 5 | 1222497 | 1222554 | Adineta vaga 104782 | TGC|GTAAGAAAAT...TGCTTTTTAATT/ATTTTATTGATT...ATTAG|ATA | 2 | 1 | 17.175 |
3822926 | GT-AG | 0 | 0.0035286801305279 | 57 | rna-gnl|I4U23|000290-T1 720825 | 6 | 1222730 | 1222786 | Adineta vaga 104782 | AAA|GTATGTTCAA...TCTTTCTTTTTT/CTTTTTTTCAAA...TTTAG|ATG | 0 | 1 | 20.81 |
3822927 | GT-AG | 0 | 9.963361043156916e-05 | 55 | rna-gnl|I4U23|000290-T1 720825 | 7 | 1222875 | 1222929 | Adineta vaga 104782 | TAT|GTAATTCTAA...TATTTCTTTTCA/TTTCTTTTCAAA...TTTAG|ATC | 1 | 1 | 22.638 |
3822928 | GT-AG | 0 | 1.000000099473604e-05 | 74 | rna-gnl|I4U23|000290-T1 720825 | 8 | 1223021 | 1223094 | Adineta vaga 104782 | GAG|GTCAATTGAA...TTTTCCTTTTCT/ACCAGATTTACT...GATAG|TCA | 2 | 1 | 24.528 |
3822929 | GT-AG | 0 | 0.0001761118685981 | 60 | rna-gnl|I4U23|000290-T1 720825 | 9 | 1223144 | 1223203 | Adineta vaga 104782 | GAA|GTAAATTCAT...TTCTCTTTATTG/ATTGTATTGATT...TTTAG|GCG | 0 | 1 | 25.545 |
3822930 | GT-AG | 0 | 1.000000099473604e-05 | 106 | rna-gnl|I4U23|000290-T1 720825 | 10 | 1223285 | 1223390 | Adineta vaga 104782 | CTG|GTAAGAAGAA...CTTTCGTTAAAC/CTTTCGTTAAAC...TTTAG|TGT | 0 | 1 | 27.227 |
3822931 | GT-AG | 0 | 8.047254746881015e-05 | 68 | rna-gnl|I4U23|000290-T1 720825 | 11 | 1223483 | 1223550 | Adineta vaga 104782 | AAG|GTATGATATT...TTGTTTTTGTTT/AATCTACTCATA...TACAG|TTT | 2 | 1 | 29.138 |
3822932 | GT-AG | 0 | 7.796590735426075e-05 | 52 | rna-gnl|I4U23|000290-T1 720825 | 12 | 1223741 | 1223792 | Adineta vaga 104782 | CCT|GTAAGTATCT...GTCTATTTGATC/GTCTATTTGATC...TTTAG|TTA | 0 | 1 | 33.084 |
3822933 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|000290-T1 720825 | 13 | 1223895 | 1223949 | Adineta vaga 104782 | AAA|GTAAGATTTG...TGTATTTCAATC/TTGTATTTCAAT...CTTAG|GAT | 0 | 1 | 35.202 |
3822934 | GT-AG | 0 | 6.091170296896744e-05 | 64 | rna-gnl|I4U23|000290-T1 720825 | 14 | 1223997 | 1224060 | Adineta vaga 104782 | ACG|GTAAATTGTG...GTTTTCATATTG/TTGGTTTTCATA...TGAAG|ATT | 2 | 1 | 36.179 |
3822935 | GT-AG | 0 | 1.4492586865950807e-05 | 51 | rna-gnl|I4U23|000290-T1 720825 | 15 | 1224221 | 1224271 | Adineta vaga 104782 | AAG|GTATTGATAA...AATCTTTTCATT/ATTTGACTCATC...TATAG|CAT | 0 | 1 | 39.502 |
3822936 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|000290-T1 720825 | 16 | 1224332 | 1224391 | Adineta vaga 104782 | CAA|GTGAGAGAGA...TTCTTCTTGAAA/TGCAATTTAATT...TCCAG|TCA | 0 | 1 | 40.748 |
3822937 | GT-AG | 0 | 1.7606226789858757e-05 | 59 | rna-gnl|I4U23|000290-T1 720825 | 17 | 1224547 | 1224605 | Adineta vaga 104782 | TCG|GTAAATTGAA...CTCTGTTTGATG/CTCTGTTTGATG...TAAAG|ATT | 2 | 1 | 43.967 |
3822938 | GT-AG | 0 | 1.000000099473604e-05 | 71 | rna-gnl|I4U23|000290-T1 720825 | 18 | 1224736 | 1224806 | Adineta vaga 104782 | ACG|GTAAGTGGAA...TTTCTCTTATAC/GTTTCTCTTATA...TTTAG|GCA | 0 | 1 | 46.667 |
3822939 | GT-AG | 0 | 1.2704493727920072e-05 | 59 | rna-gnl|I4U23|000290-T1 720825 | 19 | 1224972 | 1225030 | Adineta vaga 104782 | AAA|GTAATTGATT...TTCTATTTAATT/TTCTATTTAATT...AATAG|AGA | 0 | 1 | 50.093 |
3822940 | GT-AG | 0 | 1.1048211589420428e-05 | 55 | rna-gnl|I4U23|000290-T1 720825 | 20 | 1225633 | 1225687 | Adineta vaga 104782 | AGG|GTAAATACAT...TATTTCTTGATG/TATCTTTTCATT...TACAG|AAC | 2 | 1 | 62.596 |
3822941 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-gnl|I4U23|000290-T1 720825 | 21 | 1226080 | 1226144 | Adineta vaga 104782 | ATG|GTAAAACAAA...TTTTTCTTTACA/TTTTTCTTTACA...TTTAG|TTG | 1 | 1 | 70.737 |
3822942 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|000290-T1 720825 | 22 | 1226881 | 1226938 | Adineta vaga 104782 | TGG|GTAGGAATTG...CATTTATTAATT/CATTTATTAATT...TTTAG|AAA | 2 | 1 | 86.023 |
3822943 | GT-AG | 0 | 0.0545521726843638 | 60 | rna-gnl|I4U23|000290-T1 720825 | 23 | 1227042 | 1227101 | Adineta vaga 104782 | GAA|GTAGTTTTTC...TTTTTCTTATTA/TTTTTATTTATT...TCAAG|ATA | 0 | 1 | 88.162 |
3822944 | GT-AG | 0 | 0.0002761958905087 | 53 | rna-gnl|I4U23|000290-T1 720825 | 24 | 1227229 | 1227281 | Adineta vaga 104782 | GAG|GTAATTTTAT...ATTTCCTTAAGA/TATTTCCTTAAG...AACAG|AGT | 1 | 1 | 90.8 |
3822945 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|000290-T1 720825 | 25 | 1227535 | 1227596 | Adineta vaga 104782 | AAG|GTAAACGAAT...ATGTATTTGATT/TTTGATTTCATT...TTTAG|TTT | 2 | 1 | 96.054 |
3822946 | GT-AG | 0 | 1.000000099473604e-05 | 47 | rna-gnl|I4U23|000290-T1 720825 | 26 | 1227736 | 1227782 | Adineta vaga 104782 | AAA|GTAAAGAAAA...TTCGTTTTATTT/TTTCGTTTTATT...TCTAG|TTA | 0 | 1 | 98.941 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);