introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
45 rows where transcript_id = 720732
This data as json, CSV (advanced)
Suggested facets: score, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821182 | GT-AG | 0 | 1.5296213248749707e-05 | 413 | rna-gnl|I4U23|002801-T2 720732 | 1 | 8821098 | 8821510 | Adineta vaga 104782 | TTC|GTAAGTAATA...TGTCTATTGATT/TGTCTATTGATT...TTTAG|TCT | 0 | 1 | 0.303 |
3821183 | GT-AG | 0 | 0.0012605535531556 | 70 | rna-gnl|I4U23|002801-T2 720732 | 2 | 8820926 | 8820995 | Adineta vaga 104782 | GCT|GTATGTGAAA...CTTCTCTTTTCT/TCTTTTCTTATG...TTTAG|AAT | 0 | 1 | 1.094 |
3821184 | GT-AG | 0 | 0.0940907370169635 | 62 | rna-gnl|I4U23|002801-T2 720732 | 3 | 8820832 | 8820893 | Adineta vaga 104782 | AAA|GTAACTTCTT...TATATTTTATTT/GTATATTTTATT...TCTAG|AGA | 2 | 1 | 1.342 |
3821185 | GT-AG | 0 | 1.000000099473604e-05 | 72 | rna-gnl|I4U23|002801-T2 720732 | 4 | 8820693 | 8820764 | Adineta vaga 104782 | AAG|GTCAGTTTAA...ATAATCTTATTG/TATAATCTTATT...TCTAG|AAA | 0 | 1 | 1.862 |
3821186 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|002801-T2 720732 | 5 | 8820499 | 8820562 | Adineta vaga 104782 | CAC|GTGAGTATTG...TAATTATTGATA/TAATTATTGATA...TCTAG|AGG | 1 | 1 | 2.87 |
3821187 | GT-AG | 0 | 0.0001788540626004 | 57 | rna-gnl|I4U23|002801-T2 720732 | 6 | 8820374 | 8820430 | Adineta vaga 104782 | GAG|GTAAACATTG...ATTGTTTTGATC/ATTGTTTTGATC...TCAAG|GAT | 0 | 1 | 3.398 |
3821188 | GT-AG | 0 | 2.014930759671719e-05 | 60 | rna-gnl|I4U23|002801-T2 720732 | 7 | 8820125 | 8820184 | Adineta vaga 104782 | AAG|GTATGTAGAG...CAATTTTTCATG/CAATTTTTCATG...ATTAG|GAA | 0 | 1 | 4.864 |
3821189 | GT-AG | 0 | 0.0003635363596306 | 354 | rna-gnl|I4U23|002801-T2 720732 | 8 | 8819741 | 8820094 | Adineta vaga 104782 | AAG|GTTTGTTTAT...TTCGCTTTATTT/TTTCGCTTTATT...ATTAG|TCC | 0 | 1 | 5.097 |
3821190 | GT-AG | 0 | 1.000000099473604e-05 | 369 | rna-gnl|I4U23|002801-T2 720732 | 9 | 8819273 | 8819641 | Adineta vaga 104782 | AGT|GTAAGAAAAG...GTTTTCTTTCCT/TTAATACTAAGA...TGTAG|CAA | 0 | 1 | 5.865 |
3821191 | GT-AG | 0 | 0.0003201722469177 | 52 | rna-gnl|I4U23|002801-T2 720732 | 10 | 8819112 | 8819163 | Adineta vaga 104782 | TCA|GTAAGTTTGA...GATTTGTTATTC/TGATTTGTTATT...ATTAG|CAC | 1 | 1 | 6.71 |
3821192 | GT-AG | 0 | 0.0012399981306171 | 66 | rna-gnl|I4U23|002801-T2 720732 | 11 | 8818932 | 8818997 | Adineta vaga 104782 | TGG|GTATGTAATA...TTTGTTTTATTT/TTTTGTTTTATT...TTTAG|CTG | 1 | 1 | 7.594 |
3821193 | GT-AG | 0 | 0.0001058633095445 | 59 | rna-gnl|I4U23|002801-T2 720732 | 12 | 8818796 | 8818854 | Adineta vaga 104782 | TCG|GTTCGTTTAT...CTTTCTTTATTT/TCTTTCTTTATT...TAAAG|GAT | 0 | 1 | 8.192 |
3821194 | GT-AG | 0 | 0.0226045957412433 | 61 | rna-gnl|I4U23|002801-T2 720732 | 13 | 8818141 | 8818201 | Adineta vaga 104782 | AAA|GTATGTTTTG...ATTGTTTTTGCG/ATCATATTGAAA...TCTAG|AGA | 0 | 1 | 12.8 |
3821195 | GT-AG | 0 | 1.4488530731568208e-05 | 65 | rna-gnl|I4U23|002801-T2 720732 | 14 | 8817977 | 8818041 | Adineta vaga 104782 | AAA|GTAAGTCTTT...TTAGTTTGAATT/ATTAGTTTGAAT...TCTAG|AAT | 0 | 1 | 13.568 |
3821196 | GT-AG | 0 | 1.000000099473604e-05 | 178 | rna-gnl|I4U23|002801-T2 720732 | 15 | 8817403 | 8817580 | Adineta vaga 104782 | CTT|GTAAGTGAAT...ATATATTTCATT/ATATATTTCATT...TCTAG|AGT | 0 | 1 | 16.64 |
3821197 | GT-AG | 0 | 0.0117798283093386 | 59 | rna-gnl|I4U23|002801-T2 720732 | 16 | 8817245 | 8817303 | Adineta vaga 104782 | AAC|GTATAAATGA...TTTTTCTTATTT/ATTTTTCTTATT...TTAAG|CTT | 0 | 1 | 17.407 |
3821198 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|002801-T2 720732 | 17 | 8817095 | 8817145 | Adineta vaga 104782 | AAA|GTAAAAAAAA...TTTCTCTTAATG/GTTTCTCTTAAT...TCTAG|CAT | 0 | 1 | 18.175 |
3821199 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|002801-T2 720732 | 18 | 8816912 | 8816968 | Adineta vaga 104782 | AAG|GTAGAGATTC...TATGCTTTCGTC/GATAATTTTATG...TATAG|ATT | 0 | 1 | 19.153 |
3821200 | GT-AG | 0 | 9.78067806787894e-05 | 66 | rna-gnl|I4U23|002801-T2 720732 | 19 | 8816785 | 8816850 | Adineta vaga 104782 | CAG|GTAAATTTTC...TATTTCGTAAAA/AACATATTGATT...AAAAG|AAA | 1 | 1 | 19.626 |
3821201 | GT-AG | 0 | 1.000000099473604e-05 | 181 | rna-gnl|I4U23|002801-T2 720732 | 20 | 8816475 | 8816655 | Adineta vaga 104782 | AAG|GTAGTCGAAT...GGTTATTTATTG/CGGTTATTTATT...TTTAG|ATA | 1 | 1 | 20.627 |
3821202 | GT-AG | 0 | 1.000000099473604e-05 | 68 | rna-gnl|I4U23|002801-T2 720732 | 21 | 8816300 | 8816367 | Adineta vaga 104782 | AAG|GTAAATAAAA...TTCTTTTTATAT/TTTCTTTTTATA...TAAAG|GAA | 0 | 1 | 21.457 |
3821203 | GT-AG | 0 | 1.000000099473604e-05 | 129 | rna-gnl|I4U23|002801-T2 720732 | 22 | 8816028 | 8816156 | Adineta vaga 104782 | AGG|GTGAGTAATC...CTTTTTTTATAC/TCTTTTTTTATA...TCTAG|GGA | 2 | 1 | 22.566 |
3821204 | GT-AG | 0 | 0.0012409994739191 | 130 | rna-gnl|I4U23|002801-T2 720732 | 23 | 8815877 | 8816006 | Adineta vaga 104782 | ATC|GTAAGTATTT...TGTTTCTTATCT/TTTGTATTTATT...TATAG|ATT | 2 | 1 | 22.729 |
3821205 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-gnl|I4U23|002801-T2 720732 | 24 | 8815506 | 8815651 | Adineta vaga 104782 | CGG|GTAAAGAAAA...CGAACCTTATTT/ATTTGATTTATT...TTTAG|ACC | 2 | 1 | 24.474 |
3821206 | GT-AG | 0 | 2.153997629047449e-05 | 54 | rna-gnl|I4U23|002801-T2 720732 | 25 | 8815291 | 8815344 | Adineta vaga 104782 | AAG|GTAATATTCA...TTTTTTTTAAAA/TTTTTTTTTAAA...TCTAG|ATT | 1 | 1 | 25.723 |
3821207 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|002801-T2 720732 | 26 | 8814696 | 8814752 | Adineta vaga 104782 | TCG|GTAAGAGTTT...TCTATTTTGACT/GGTTTTTTCATT...CGTAG|ATT | 2 | 1 | 29.897 |
3821208 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|002801-T2 720732 | 27 | 8814083 | 8814139 | Adineta vaga 104782 | AAA|GTAAGAAATT...TTTTCTTTTGTT/CTTTTGTTTATT...CCTAG|GAT | 0 | 1 | 34.21 |
3821209 | GT-AG | 0 | 0.0001380225987922 | 50 | rna-gnl|I4U23|002801-T2 720732 | 28 | 8814021 | 8814070 | Adineta vaga 104782 | GAA|GTAAAATTTT...AATATTTTGATT/AATATTTTGATT...TTTAG|AAT | 0 | 1 | 34.303 |
3821210 | GT-AG | 0 | 1.837519383458241e-05 | 55 | rna-gnl|I4U23|002801-T2 720732 | 29 | 8813911 | 8813965 | Adineta vaga 104782 | AAA|GTAAACAGAT...GAATTTTCAATA/GGAATTTTCAAT...TTCAG|GTT | 1 | 1 | 34.73 |
3821211 | GT-AG | 0 | 1.000000099473604e-05 | 221 | rna-gnl|I4U23|002801-T2 720732 | 30 | 8813486 | 8813706 | Adineta vaga 104782 | CAG|GTAAGAAATA...TTGTTTTTTTTT/ATTTGTGTCACA...CTTAG|GAA | 1 | 1 | 36.312 |
3821212 | GT-AG | 0 | 2.86598881938793 | 59 | rna-gnl|I4U23|002801-T2 720732 | 31 | 8813260 | 8813318 | Adineta vaga 104782 | CGT|GTATGCTATA...TCATTTTTAACT/TTGATTTTCATT...TATAG|TTG | 0 | 1 | 37.608 |
3821213 | GT-AG | 0 | 1.000000099473604e-05 | 58 | rna-gnl|I4U23|002801-T2 720732 | 32 | 8813144 | 8813201 | Adineta vaga 104782 | CTA|GTAAGTAGAA...AATTGTTTATCG/CAGTTTTTCATG...GAAAG|ATT | 1 | 1 | 38.058 |
3821214 | GT-AG | 0 | 1.000000099473604e-05 | 87 | rna-gnl|I4U23|002801-T2 720732 | 33 | 8812808 | 8812894 | Adineta vaga 104782 | TAG|GTTCATAACT...GTGTTGTTATTT/TGTGTTGTTATT...TAAAG|GTG | 1 | 1 | 39.989 |
3821215 | GT-AG | 0 | 0.0114462475451545 | 93 | rna-gnl|I4U23|002801-T2 720732 | 34 | 8812448 | 8812540 | Adineta vaga 104782 | TTT|GTTTGTATTG...CTCTTCTTAAAT/CTCTTCTTAAAT...TTTAG|CTG | 1 | 1 | 42.06 |
3821216 | GT-AG | 0 | 0.0188650121016754 | 51 | rna-gnl|I4U23|002801-T2 720732 | 35 | 8812310 | 8812360 | Adineta vaga 104782 | TGG|GTATGTTGTC...TTATTTTTAGTT/TTTATTTTTAGT...TATAG|TGA | 1 | 1 | 42.735 |
3821217 | GT-AG | 0 | 1.3427807270594932e-05 | 1133 | rna-gnl|I4U23|002801-T2 720732 | 36 | 8811153 | 8812285 | Adineta vaga 104782 | CAA|GTAGAAATTA...ATTTCTTTTGTT/ACGAAGATGATA...TTTAG|AAT | 1 | 1 | 42.921 |
3821218 | GT-AG | 0 | 0.0003618751926116 | 94 | rna-gnl|I4U23|002801-T2 720732 | 37 | 8810867 | 8810960 | Adineta vaga 104782 | TTC|GTAAGTTGAA...TTGTTTTTAATC/TTGTTTTTAATC...TCCAG|CAC | 1 | 1 | 44.411 |
3821219 | GT-AG | 0 | 0.0001302379738036 | 182 | rna-gnl|I4U23|002801-T2 720732 | 38 | 8809122 | 8809303 | Adineta vaga 104782 | AAG|GTAGTCAACA...TTTTTTTTGATA/TTTTTTTTGATA...CATAG|CGT | 1 | 1 | 56.536 |
3821220 | GT-AG | 0 | 2.551587595014115e-05 | 127 | rna-gnl|I4U23|002801-T2 720732 | 39 | 8808881 | 8809007 | Adineta vaga 104782 | AAG|GTTTGCGTAT...TCTTTCTTTTTT/TGAATCCTCACT...TTTAG|ATA | 1 | 1 | 57.42 |
3821221 | GT-AG | 0 | 3.156523942719496e-05 | 966 | rna-gnl|I4U23|002801-T2 720732 | 40 | 8807870 | 8808835 | Adineta vaga 104782 | AAG|GTAAGCATAA...TTTTCCTTGTTT/TTGTTTTCTATT...TACAG|CTA | 1 | 1 | 57.769 |
3821222 | GT-AG | 0 | 3.7451622229773305e-05 | 53 | rna-gnl|I4U23|002801-T2 720732 | 41 | 8807571 | 8807623 | Adineta vaga 104782 | TTA|GTAGATAAAC...GCTACTTCAACT/CTAATACTCAAC...GAAAG|AGT | 1 | 1 | 59.677 |
3821223 | GT-AG | 0 | 1.000000099473604e-05 | 90 | rna-gnl|I4U23|002801-T2 720732 | 42 | 8804677 | 8804766 | Adineta vaga 104782 | GAG|GTAAGAAGAT...TATCTTTTTATT/TATCTTTTTATT...TTTAG|TCT | 0 | 1 | 81.429 |
3821224 | GT-AG | 0 | 3.721614480513704e-05 | 61 | rna-gnl|I4U23|002801-T2 720732 | 43 | 8804401 | 8804461 | Adineta vaga 104782 | ATC|GTCGACAACA...ATCGCCATGAAG/AAAGAATTTATA...GCAAG|TTA | 2 | 1 | 83.097 |
3821225 | GT-AG | 0 | 1.000000099473604e-05 | 64 | rna-gnl|I4U23|002801-T2 720732 | 44 | 8803174 | 8803237 | Adineta vaga 104782 | AAA|GTGTTACGAC...TCGATATTATTC/ATATTATTCAAA...ATTAG|TTC | 1 | 1 | 92.119 |
3821226 | GT-AG | 0 | 0.000340592321637 | 52 | rna-gnl|I4U23|002801-T2 720732 | 45 | 8802723 | 8802774 | Adineta vaga 104782 | ATA|GTCCACAAAA...TATGCTTTATTC/GCTTTATTCATC...AAAAG|CAG | 1 | 1 | 95.214 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);