introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 720820
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822867 | GT-AG | 0 | 0.002047910156917 | 48 | rna-gnl|I4U23|003869-T1 720820 | 1 | 11987920 | 11987967 | Adineta vaga 104782 | TAT|GTATGTAAAA...TTACTTTTGAAA/ATTTATGTTACT...CCTAG|GTA | 0 | 1 | 4.446 |
3822868 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|003869-T1 720820 | 2 | 11987671 | 11987719 | Adineta vaga 104782 | TGA|GTAAGTCAAA...CAGTTCATAATG/TTTCAGTTCATA...AATAG|ATA | 2 | 1 | 8.506 |
3822869 | GT-AG | 0 | 0.015790039287878 | 52 | rna-gnl|I4U23|003869-T1 720820 | 3 | 11987490 | 11987541 | Adineta vaga 104782 | TGC|GTATGTGTAA...TTCGCTTTGAAT/TTGAATGTGATT...TTTAG|GTA | 2 | 1 | 11.125 |
3822870 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|003869-T1 720820 | 4 | 11987338 | 11987397 | Adineta vaga 104782 | CAG|GTAAAACATT...TGTTTATTAATT/TAATTATTCATT...TATAG|GTT | 1 | 1 | 12.992 |
3822871 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003869-T1 720820 | 5 | 11987189 | 11987242 | Adineta vaga 104782 | TTG|GTTTGACATT...TAGATCTTTTTT/TATTTAGTGAAT...AATAG|ATT | 0 | 1 | 14.921 |
3822872 | GT-AG | 0 | 0.0514669443228313 | 47 | rna-gnl|I4U23|003869-T1 720820 | 6 | 11987009 | 11987055 | Adineta vaga 104782 | AAA|GTATGTTGAG...GTTTTCTTGACA/GTTTTCTTGACA...TTTAG|GTT | 1 | 1 | 17.621 |
3822873 | GT-AG | 0 | 0.0611172548616978 | 60 | rna-gnl|I4U23|003869-T1 720820 | 7 | 11986846 | 11986905 | Adineta vaga 104782 | AAA|GTATGTTTTG...ATTCATTTAACA/TTTTGCTTCATC...TTTAG|TCC | 2 | 1 | 19.712 |
3822874 | GT-AG | 0 | 0.0001570829386354 | 51 | rna-gnl|I4U23|003869-T1 720820 | 8 | 11986774 | 11986824 | Adineta vaga 104782 | AAG|GTATAGATTT...ACTTCTTTGTTT/TTTTTGTTCATA...TTTAG|CTT | 2 | 1 | 20.138 |
3822875 | GT-AG | 0 | 0.0003760623930878 | 57 | rna-gnl|I4U23|003869-T1 720820 | 9 | 11986638 | 11986694 | Adineta vaga 104782 | TTA|GTAAACGAAA...TTTCTTTTAACT/TTTCTTTTAACT...GATAG|GTC | 0 | 1 | 21.742 |
3822876 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|003869-T1 720820 | 10 | 11986345 | 11986400 | Adineta vaga 104782 | AAA|GTAAAATAGT...TTCGTTTTAAAT/AAGTTTTTCATT...TTTAG|AAA | 0 | 1 | 26.553 |
3822877 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|003869-T1 720820 | 11 | 11986064 | 11986130 | Adineta vaga 104782 | AAA|GTCAGTTATT...CTCTTCTCAAAA/TAAAATCTCATT...TTCAG|ATG | 1 | 1 | 30.897 |
3822878 | GT-AG | 0 | 1.7343178892165522e-05 | 47 | rna-gnl|I4U23|003869-T1 720820 | 12 | 11985916 | 11985962 | Adineta vaga 104782 | CAA|GTAAGTTCTA...ATTGTTTCGACA/GACAAACTTACA...TCTAG|GCA | 0 | 1 | 32.948 |
3822879 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003869-T1 720820 | 13 | 11985547 | 11985600 | Adineta vaga 104782 | TTG|GTAAATAATT...ATATCGTTTGCA/ATAAAAGTTATA...TATAG|TTA | 0 | 1 | 39.342 |
3822880 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|003869-T1 720820 | 14 | 11985352 | 11985405 | Adineta vaga 104782 | CAG|GTAAGTGAAA...TCATTCGTAATG/TAAAGATTCATT...TATAG|TTA | 0 | 1 | 42.205 |
3822881 | GT-AG | 0 | 0.0002108471498215 | 56 | rna-gnl|I4U23|003869-T1 720820 | 15 | 11985216 | 11985271 | Adineta vaga 104782 | TAT|GTAAGTTTAT...CAAATTTTGATC/CAAATTTTGATC...CAAAG|GAT | 2 | 1 | 43.829 |
3822882 | GT-AG | 0 | 3.867855759566874e-05 | 51 | rna-gnl|I4U23|003869-T1 720820 | 16 | 11985047 | 11985097 | Adineta vaga 104782 | GTG|GTAAGTTAAA...GATTTCTTATTC/AGATTTCTTATT...CATAG|ATA | 0 | 1 | 46.224 |
3822883 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|003869-T1 720820 | 17 | 11984504 | 11984554 | Adineta vaga 104782 | CTA|GTAAGTTGAT...AGAATGATAAAA/AAGAAACTAAGA...CAAAG|GCT | 0 | 1 | 56.212 |
3822884 | GT-AG | 0 | 9.826249842544672e-05 | 51 | rna-gnl|I4U23|003869-T1 720820 | 18 | 11984156 | 11984206 | Adineta vaga 104782 | CGA|GTATGAAGAA...CATTTCTTGTTT/TTTCTTCTAATC...AATAG|ATT | 0 | 1 | 62.241 |
3822885 | GT-AG | 0 | 0.0001865888299375 | 55 | rna-gnl|I4U23|003869-T1 720820 | 19 | 11983831 | 11983885 | Adineta vaga 104782 | GTT|GTAAGTTAGA...ATATCTTTGCTT/ATTAAATACAAG...AATAG|GCA | 0 | 1 | 67.722 |
3822886 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna-gnl|I4U23|003869-T1 720820 | 20 | 11983731 | 11983776 | Adineta vaga 104782 | GAT|GTAAAGAAAA...TCATCTTTGGAT/GGATTATTTATA...TATAG|ATA | 0 | 1 | 68.819 |
3822887 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|003869-T1 720820 | 21 | 11983538 | 11983592 | Adineta vaga 104782 | CTG|GTAAGTTTTT...AAGAGATTGATG/AAGAGATTGATG...TTTAG|TTA | 0 | 1 | 71.62 |
3822888 | GT-AG | 0 | 0.0083671417037809 | 54 | rna-gnl|I4U23|003869-T1 720820 | 22 | 11983373 | 11983426 | Adineta vaga 104782 | AGA|GTATGTTCTT...ACATATTTAGAA/GACATATTTAGA...TCTAG|TTT | 0 | 1 | 73.873 |
3822889 | GT-AG | 0 | 1.8719571955117645e-05 | 67 | rna-gnl|I4U23|003869-T1 720820 | 23 | 11983139 | 11983205 | Adineta vaga 104782 | TAC|GTAAGTAAAA...GTGTTTTTGATC/GTGTTTTTGATC...ATTAG|ATT | 2 | 1 | 77.263 |
3822890 | GT-AG | 0 | 0.0018232842183891 | 46 | rna-gnl|I4U23|003869-T1 720820 | 24 | 11982989 | 11983034 | Adineta vaga 104782 | TTG|GTATGTTGTT...TTATTTGTATCT/TTGTATCTAATA...TGTAG|AAC | 1 | 1 | 79.375 |
3822891 | GT-AG | 0 | 0.0001212117629994 | 55 | rna-gnl|I4U23|003869-T1 720820 | 25 | 11982832 | 11982886 | Adineta vaga 104782 | AAG|GTTTGTTCTT...AATCTTTTAGTT/TTCAACTTTATA...TACAG|GAG | 1 | 1 | 81.445 |
3822892 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|003869-T1 720820 | 26 | 11982404 | 11982460 | Adineta vaga 104782 | GTG|GTAAGTATGA...TTGTTTTTGAAA/TCTAGTCTTACT...TAAAG|GTT | 0 | 1 | 88.977 |
3822893 | GT-AG | 0 | 1.321956713436986e-05 | 49 | rna-gnl|I4U23|003869-T1 720820 | 27 | 11982292 | 11982340 | Adineta vaga 104782 | CAA|GTAAATCTAT...ATTTTGTTATTA/TCATGTTTGATT...TTTAG|GGT | 0 | 1 | 90.256 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);