introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 720763
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3821991 | GT-AG | 0 | 4.397803697776211 | 50 | rna-gnl|I4U23|004462-T1 720763 | 1 | 13681402 | 13681451 | Adineta vaga 104782 | TGA|GTATCTTGAG...AACTTCATGATA/ATGATACTTATG...TTCAG|GTT | 1 | 1 | 5.869 |
3821992 | GT-AG | 0 | 3.607884895783881e-05 | 69 | rna-gnl|I4U23|004462-T1 720763 | 2 | 13681563 | 13681631 | Adineta vaga 104782 | ATT|GTTCCAGATC...GGTTACTAAATG/TGGTTACTAAAT...GCCAG|TTC | 1 | 1 | 7.544 |
3821993 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-gnl|I4U23|004462-T1 720763 | 3 | 13681913 | 13681969 | Adineta vaga 104782 | AAT|GTAAGAGATG...GTTTTTTCAAAT/AAAATTCTCATT...CATAG|GAT | 0 | 1 | 11.783 |
3821994 | GT-AG | 0 | 0.0006166124601568 | 55 | rna-gnl|I4U23|004462-T1 720763 | 4 | 13682260 | 13682314 | Adineta vaga 104782 | TGC|GTAAATAATT...TATTCTTTATCA/TTATTCTTTATC...TTTAG|TTG | 2 | 1 | 16.159 |
3821995 | GT-AG | 0 | 0.0240504054658082 | 55 | rna-gnl|I4U23|004462-T1 720763 | 5 | 13682527 | 13682581 | Adineta vaga 104782 | TTG|GTATGTTTTT...TCTTTTTAAATT/TTTAAATTGATA...TATAG|GTA | 1 | 1 | 19.357 |
3821996 | GT-AG | 0 | 0.0002078165812054 | 62 | rna-gnl|I4U23|004462-T1 720763 | 6 | 13682837 | 13682898 | Adineta vaga 104782 | ATG|GTAAATATTC...TTTTCTTTGATA/TTTTCTTTGATA...TTCAG|TTG | 1 | 1 | 23.205 |
3821997 | GT-AG | 0 | 1.000000099473604e-05 | 62 | rna-gnl|I4U23|004462-T1 720763 | 7 | 13683265 | 13683326 | Adineta vaga 104782 | CTA|GTAAGATAAT...TTTCTCTTTATA/TTTCTCTTTATA...TTAAG|ATG | 1 | 1 | 28.727 |
3821998 | GT-AG | 0 | 1.9869703766866268e-05 | 57 | rna-gnl|I4U23|004462-T1 720763 | 8 | 13683575 | 13683631 | Adineta vaga 104782 | TCG|GTAAATCATA...CTTTTTTTATAA/TCTTTTTTTATA...TAAAG|TTG | 0 | 1 | 32.468 |
3821999 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-gnl|I4U23|004462-T1 720763 | 9 | 13683758 | 13683813 | Adineta vaga 104782 | ACT|GTAAGAGTTT...CAAATATTAAAT/ATTAAATTCATC...TTTAG|GCT | 0 | 1 | 34.369 |
3822000 | GT-AG | 0 | 0.0001558861579124 | 59 | rna-gnl|I4U23|004462-T1 720763 | 10 | 13683929 | 13683987 | Adineta vaga 104782 | ATA|GTAAATCTAT...TATTTCATAATT/TTTTTATTGAAC...AATAG|AAT | 1 | 1 | 36.104 |
3822001 | GT-AG | 0 | 0.0005441433094466 | 63 | rna-gnl|I4U23|004462-T1 720763 | 11 | 13684065 | 13684127 | Adineta vaga 104782 | CAA|GTATGATCAA...TGTTTCTTTGTT/TCGTACATGATG...AATAG|GCA | 0 | 1 | 37.266 |
3822002 | GT-AG | 0 | 0.000219549516788 | 58 | rna-gnl|I4U23|004462-T1 720763 | 12 | 13684371 | 13684428 | Adineta vaga 104782 | CGT|GTAAATCATT...AAATCTTTGAAT/TTTCAATTCATA...TATAG|TCA | 0 | 1 | 40.932 |
3822003 | GT-AG | 0 | 0.0335811069039294 | 53 | rna-gnl|I4U23|004462-T1 720763 | 13 | 13684832 | 13684884 | Adineta vaga 104782 | TGG|GTATGTTCTT...TTCTTCTTGAAT/TTCTTCTTGAAT...TGTAG|AAT | 1 | 1 | 47.013 |
3822004 | GT-AG | 0 | 0.0007699486031618 | 52 | rna-gnl|I4U23|004462-T1 720763 | 14 | 13685142 | 13685193 | Adineta vaga 104782 | ATT|GTAAGTTTGA...TGAATTTTGATT/TGAATTTTGATT...TGAAG|GTT | 0 | 1 | 50.89 |
3822005 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-gnl|I4U23|004462-T1 720763 | 15 | 13685422 | 13685470 | Adineta vaga 104782 | AAG|GTATGACAAA...AGATCTTTGAAC/CATATGTTTACT...TTCAG|TTG | 0 | 1 | 54.33 |
3822006 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|004462-T1 720763 | 16 | 13685573 | 13685625 | Adineta vaga 104782 | AAA|GTAAGAGAAA...GATCTTTTGACT/GATCTTTTGACT...TTCAG|GTG | 0 | 1 | 55.869 |
3822007 | GT-AG | 0 | 0.0066112756149843 | 58 | rna-gnl|I4U23|004462-T1 720763 | 17 | 13685742 | 13685799 | Adineta vaga 104782 | AAA|GTATTTCATT...ATATTCTTTGCA/TCTTTGCACATT...TTCAG|ATC | 2 | 1 | 57.619 |
3822008 | GT-AG | 0 | 1.0306851171304396 | 61 | rna-gnl|I4U23|004462-T1 720763 | 18 | 13685966 | 13686026 | Adineta vaga 104782 | ATT|GTATGTTTTA...CATTTTTTATTC/ACATTTTTTATT...AATAG|ATT | 0 | 1 | 60.124 |
3822009 | GT-AG | 0 | 9.241883292235864e-05 | 58 | rna-gnl|I4U23|004462-T1 720763 | 19 | 13686681 | 13686738 | Adineta vaga 104782 | AAA|GTAAATCATT...TTTACTTTAGCG/TAGCGATTGATC...TATAG|ATT | 0 | 1 | 69.991 |
3822010 | GT-AG | 0 | 0.0007900602124227 | 57 | rna-gnl|I4U23|004462-T1 720763 | 20 | 13686746 | 13686802 | Adineta vaga 104782 | TTT|GTTTGTCAAT...TATTCTTTATAT/CCAATTATGACA...AATAG|CTG | 1 | 1 | 70.097 |
3822011 | GT-AG | 0 | 4.1647088070159634e-05 | 57 | rna-gnl|I4U23|004462-T1 720763 | 21 | 13686874 | 13686930 | Adineta vaga 104782 | GAG|GTTGATTTTC...TTATTTTTATTA/GTTATTTTTATT...ATTAG|ATT | 0 | 1 | 71.168 |
3822012 | GT-AG | 0 | 1.594044247566649e-05 | 58 | rna-gnl|I4U23|004462-T1 720763 | 22 | 13687189 | 13687246 | Adineta vaga 104782 | AAT|GTAAGAATCA...TATATCTTGACG/TATATCTTGACG...TGTAG|ATA | 0 | 1 | 75.06 |
3822013 | GT-AG | 0 | 9.631220032686144e-05 | 46 | rna-gnl|I4U23|004462-T1 720763 | 23 | 13687419 | 13687464 | Adineta vaga 104782 | GAG|GTTTGTTTAT...TATCTGTTATTT/ATATCTGTTATT...TGTAG|ATC | 1 | 1 | 77.655 |
3822014 | GT-AG | 0 | 1.000000099473604e-05 | 59 | rna-gnl|I4U23|004462-T1 720763 | 24 | 13687906 | 13687964 | Adineta vaga 104782 | AAG|GTAAAATTCC...TTATCTATATTT/CTATATTTGAAT...TTTAG|ATT | 1 | 1 | 84.309 |
3822015 | GT-AG | 0 | 5.300130599116179e-05 | 53 | rna-gnl|I4U23|004462-T1 720763 | 25 | 13688712 | 13688764 | Adineta vaga 104782 | CAG|GTCTATTGAA...TTTTTTTCGATT/ATTTTAGTTATT...TTCAG|GAA | 1 | 1 | 95.579 |
3822016 | GT-AG | 0 | 1.000000099473604e-05 | 3091 | rna-gnl|I4U23|004462-T1 720763 | 26 | 13688878 | 13691968 | Adineta vaga 104782 | CGA|GTAAGTGCTC...AAAGTTATACTA/ACAAAAGTTATA...TAAAG|ATA | 0 | 1 | 97.284 |
3822017 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-gnl|I4U23|004462-T1 720763 | 27 | 13692006 | 13692056 | Adineta vaga 104782 | ATA|GTGATGTTGA...ACTTCTTTACAT/AACTTCTTTACA...GTCAG|AGC | 1 | 1 | 97.842 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);