introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 720762
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821965 | GT-AG | 0 | 0.001814525502575 | 61 | rna-gnl|I4U23|003240-T1 720762 | 1 | 10168983 | 10169043 | Adineta vaga 104782 | AGC|GTAGGTTTTT...TCTCTCTCGACA/TTGAAACTTATT...TCTAG|ACG | 0 | 1 | 1.935 |
3821966 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|003240-T1 720762 | 2 | 10169252 | 10169311 | Adineta vaga 104782 | CAG|GTTAATATAC...GTTTCTTTAGAA/CTTATATTAATA...CATAG|GAG | 1 | 1 | 5.056 |
3821967 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|003240-T1 720762 | 3 | 10169455 | 10169510 | Adineta vaga 104782 | AAA|GTAAAAATCT...ATTTTCTTGCCG/ACTCTATTCATT...AATAG|GAA | 0 | 1 | 7.201 |
3821968 | GT-AG | 0 | 0.0001001711061875 | 52 | rna-gnl|I4U23|003240-T1 720762 | 4 | 10169802 | 10169853 | Adineta vaga 104782 | CGT|GTAAGTTACT...TTTTCTTTCTTT/ATATTCTTCAAT...TTTAG|TCA | 0 | 1 | 11.566 |
3821969 | GT-AG | 0 | 0.0001163671793352 | 66 | rna-gnl|I4U23|003240-T1 720762 | 5 | 10169992 | 10170057 | Adineta vaga 104782 | ACA|GTATGACAAT...TTTTTCTCAAAT/TTTTTTCTCAAA...TTTAG|ATT | 0 | 1 | 13.636 |
3821970 | GT-AG | 0 | 0.002974624650426 | 53 | rna-gnl|I4U23|003240-T1 720762 | 6 | 10170220 | 10170272 | Adineta vaga 104782 | CAG|GTATATCTTC...TGCTTGTTGATC/GATCATTTCATT...TTTAG|ATG | 0 | 1 | 16.067 |
3821971 | GT-AG | 0 | 0.002624940422659 | 55 | rna-gnl|I4U23|003240-T1 720762 | 7 | 10170434 | 10170488 | Adineta vaga 104782 | TTC|GTACGTATTT...ACTCTTTTGAAA/ACTCTTTTGAAA...TATAG|GAA | 2 | 1 | 18.482 |
3821972 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|003240-T1 720762 | 8 | 10170627 | 10170685 | Adineta vaga 104782 | ACG|GTAAATCAAT...TTTTTCTTGTTA/CAAACATTCATA...TTTAG|TGG | 2 | 1 | 20.552 |
3821973 | GT-AG | 0 | 1.000000099473604e-05 | 46 | rna-gnl|I4U23|003240-T1 720762 | 9 | 10170972 | 10171017 | Adineta vaga 104782 | TTC|GTAAGAAATT...TAGTTTTTATTT/CTAGTTTTTATT...TTTAG|GAT | 0 | 1 | 24.842 |
3821974 | GT-AG | 0 | 1.7030656732058663e-05 | 60 | rna-gnl|I4U23|003240-T1 720762 | 10 | 10171175 | 10171234 | Adineta vaga 104782 | AAA|GTAATTGCAA...AAATCTTTAACT/TTACAATTCACT...TTTAG|TGG | 1 | 1 | 27.198 |
3821975 | GT-AG | 0 | 0.0257996972970079 | 58 | rna-gnl|I4U23|003240-T1 720762 | 11 | 10171375 | 10171432 | Adineta vaga 104782 | GAA|GTATGTTTAA...TGCTCCTTCTTT/CCTTCTTTCAAT...GTTAG|GTA | 0 | 1 | 29.298 |
3821976 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|003240-T1 720762 | 12 | 10172322 | 10172370 | Adineta vaga 104782 | CAG|GTAAGATCGG...ATTTTCTTTTCG/GTTTGATTAATT...TTTAG|ATA | 1 | 1 | 42.634 |
3821977 | GT-AG | 0 | 0.0004105114337488 | 62 | rna-gnl|I4U23|003240-T1 720762 | 13 | 10172445 | 10172506 | Adineta vaga 104782 | TTG|GTTTGTATTA...TTGTTCTTGAAG/TCTTTTGTAAAT...TATAG|CAT | 0 | 1 | 43.744 |
3821978 | GT-AG | 0 | 3.250668931559501e-05 | 82 | rna-gnl|I4U23|003240-T1 720762 | 14 | 10173267 | 10173348 | Adineta vaga 104782 | CAG|GTAGAGTTTT...TTTTTCTTTTCT/GATTGAAAAACT...ATTAG|GTG | 1 | 1 | 55.146 |
3821979 | GT-AG | 0 | 1.75481469174023e-05 | 56 | rna-gnl|I4U23|003240-T1 720762 | 15 | 10173658 | 10173713 | Adineta vaga 104782 | TTG|GTAAAATTGA...TGTTTTTTATTA/TTGTTTTTTATT...CATAG|AAC | 1 | 1 | 59.781 |
3821980 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|003240-T1 720762 | 16 | 10173811 | 10173865 | Adineta vaga 104782 | ATC|GTTAGTTAAA...GAAATTTTGATT/GAAATTTTGATT...TTTAG|GGA | 2 | 1 | 61.236 |
3821981 | GT-AG | 0 | 0.0002850757172168 | 70 | rna-gnl|I4U23|003240-T1 720762 | 17 | 10173963 | 10174032 | Adineta vaga 104782 | ACT|GTAAGTTATT...ACTTTCTGAAAC/CTGAAACTAAAT...AAAAG|TGT | 0 | 1 | 62.691 |
3821982 | GT-AG | 0 | 0.0002689065026278 | 61 | rna-gnl|I4U23|003240-T1 720762 | 18 | 10174058 | 10174118 | Adineta vaga 104782 | CAT|GTATGGATAG...TTATTCTAAATA/ATTATTCTAAAT...TATAG|ATT | 1 | 1 | 63.066 |
3821983 | GT-AG | 0 | 1.000000099473604e-05 | 464 | rna-gnl|I4U23|003240-T1 720762 | 19 | 10174653 | 10175116 | Adineta vaga 104782 | GAA|GTAAGTATAC...CAACTATTATCA/ACTGCGTTCAAC...GCCAG|ACT | 1 | 1 | 71.077 |
3821984 | GT-AG | 0 | 0.003026008738137 | 56 | rna-gnl|I4U23|003240-T1 720762 | 20 | 10175926 | 10175981 | Adineta vaga 104782 | GAC|GTATGTGTAG...TAAATCTTATAC/TTAAATCTTATA...TGTAG|ACA | 0 | 1 | 83.213 |
3821985 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-gnl|I4U23|003240-T1 720762 | 21 | 10176083 | 10176149 | Adineta vaga 104782 | ACG|GTACGAATTT...GAGCTATTAAAT/TTCATGTTCACT...TTTAG|TGC | 2 | 1 | 84.728 |
3821986 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|003240-T1 720762 | 22 | 10176397 | 10176453 | Adineta vaga 104782 | AAT|GTTAGTAAAA...ATAATTTTATTT/AATAATTTTATT...TTTAG|ATA | 0 | 1 | 88.434 |
3821987 | GT-AG | 0 | 1.000000099473604e-05 | 61 | rna-gnl|I4U23|003240-T1 720762 | 23 | 10176617 | 10176677 | Adineta vaga 104782 | ATT|GTAAGAATCC...ATGTATTTAAAT/TATGTATTCAAA...TATAG|ATT | 1 | 1 | 90.879 |
3821988 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|003240-T1 720762 | 24 | 10176788 | 10176839 | Adineta vaga 104782 | AAG|GTCGGTACCA...TCATTCTAAATC/TTCATTCTAAAT...TTTAG|TTG | 0 | 1 | 92.529 |
3821989 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|003240-T1 720762 | 25 | 10177090 | 10177144 | Adineta vaga 104782 | TTG|GTAAGAATGA...AGAATCTTTTCT/GATAAAATAATC...TTTAG|CAT | 1 | 1 | 96.28 |
3821990 | GT-AG | 0 | 1.000000099473604e-05 | 113 | rna-gnl|I4U23|003240-T1 720762 | 26 | 10177259 | 10177371 | Adineta vaga 104782 | ATA|GTGAACAATT...GATTTCTTTTCA/CGATTTTTCATT...GATAG|AAA | 1 | 1 | 97.99 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);