introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
16 rows where transcript_id = 720775
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822186 | GT-AG | 0 | 1.000000099473604e-05 | 784 | rna-gnl|I4U23|005320-T1 720775 | 1 | 16470145 | 16470928 | Adineta vaga 104782 | TAG|GTTGGACGCA...AACTCTTTATAT/TTTTTGTTGATC...TAAAG|TAG | 1 | 1 | 0.508 |
3822187 | GT-AG | 0 | 4.574621952298371e-05 | 392 | rna-gnl|I4U23|005320-T1 720775 | 2 | 16471175 | 16471566 | Adineta vaga 104782 | CTG|GTGTAGTTTT...TTTCTTTTAAAA/TTAATTCTCATT...TTTAG|GTG | 1 | 1 | 4.537 |
3822188 | GT-AG | 0 | 1.1600010132462044e-05 | 48 | rna-gnl|I4U23|005320-T1 720775 | 3 | 16471609 | 16471656 | Adineta vaga 104782 | AAG|GTTTGATTTC...CTTTTCTCATTT/TCTCATTTCAAA...TATAG|TAA | 1 | 1 | 5.225 |
3822189 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|005320-T1 720775 | 4 | 16471687 | 16471741 | Adineta vaga 104782 | AAA|GTAAAAGTTT...TTTACTTTAGTT/TTAGCATTAATC...TTTAG|ATG | 1 | 1 | 5.717 |
3822190 | GT-AG | 0 | 2.1625419568727404e-05 | 54 | rna-gnl|I4U23|005320-T1 720775 | 5 | 16471763 | 16471816 | Adineta vaga 104782 | AAA|GTTAGTTTCA...AAATCTTTATTT/CTTTATTTTATT...AAAAG|CAT | 1 | 1 | 6.061 |
3822191 | GT-AG | 0 | 0.0026517526843884 | 57 | rna-gnl|I4U23|005320-T1 720775 | 6 | 16472293 | 16472349 | Adineta vaga 104782 | ACG|GTTTGTTTTC...TCTTCTTTATTT/TTCTTCTTTATT...TTTAG|GAT | 0 | 1 | 13.857 |
3822192 | GT-AG | 0 | 7.554426364828019e-05 | 54 | rna-gnl|I4U23|005320-T1 720775 | 7 | 16472622 | 16472675 | Adineta vaga 104782 | TGG|GTTTGATTAT...GTTTTTTTATCC/TGTTTTTTTATC...TTTAG|TTG | 2 | 1 | 18.313 |
3822193 | GT-AG | 0 | 2.943830407858717e-05 | 54 | rna-gnl|I4U23|005320-T1 720775 | 8 | 16472870 | 16472923 | Adineta vaga 104782 | ATG|GTAAATTAAA...TTTTTTTTATTC/ATTTTTTTTATT...AAAAG|GTC | 1 | 1 | 21.491 |
3822194 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|005320-T1 720775 | 9 | 16473179 | 16473238 | Adineta vaga 104782 | ATT|GTAAGAAGAA...ATTTTCTTTTTT/TCTTTTGTAAAT...TTTAG|ATT | 1 | 1 | 25.667 |
3822195 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|005320-T1 720775 | 10 | 16473533 | 16473581 | Adineta vaga 104782 | AAA|GTAAGAGATA...AATTTTTCGATA/AATTTTTCGATA...TTTAG|ATG | 1 | 1 | 30.483 |
3822196 | GT-AG | 0 | 1.000000099473604e-05 | 45 | rna-gnl|I4U23|005320-T1 720775 | 11 | 16473965 | 16474009 | Adineta vaga 104782 | ACA|GTAAGAAATA...TAGTTCTCGATT/TCTCGATTAAAT...TCTAG|GGT | 0 | 1 | 36.757 |
3822197 | GT-AG | 0 | 0.0018074303563577 | 52 | rna-gnl|I4U23|005320-T1 720775 | 12 | 16474125 | 16474176 | Adineta vaga 104782 | TAA|GTAAATTTCT...GAAATTTTAATC/GAAATTTTAATC...TTTAG|AAT | 1 | 1 | 38.64 |
3822198 | GT-AG | 0 | 5.911433135839387e-05 | 52 | rna-gnl|I4U23|005320-T1 720775 | 13 | 16476342 | 16476393 | Adineta vaga 104782 | AAA|GTAAAGATTT...TTCTCTTTGATA/TTCTCTTTGATA...AATAG|ATT | 0 | 1 | 74.103 |
3822199 | GT-AG | 0 | 2.6440732422454786e-05 | 254 | rna-gnl|I4U23|005320-T1 720775 | 14 | 16477286 | 16477539 | Adineta vaga 104782 | ACG|GTAAAATCTT...CTTTCCTCAATC/ACTTTCCTCAAT...AAAAG|ATA | 1 | 1 | 88.714 |
3822200 | GT-AG | 0 | 0.0039090661449094 | 50 | rna-gnl|I4U23|005320-T1 720775 | 15 | 16478074 | 16478123 | Adineta vaga 104782 | CAG|GTAACTTTTA...TTCTATTTGAAA/TTCTATTTGAAA...TTTAG|GTT | 1 | 1 | 97.461 |
3822201 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|005320-T1 720775 | 16 | 16478249 | 16478298 | Adineta vaga 104782 | CGA|GTAAGATATT...TCACTATTAAAA/TTATTAATCACT...TTTAG|ATG | 0 | 1 | 99.509 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);