introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
17 rows where transcript_id = 720813
This data as json, CSV (advanced)
Suggested facets: score, length, phase
id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3822763 | GT-AG | 0 | 0.5027524801137827 | 53 | rna-gnl|I4U23|004107-T1 720813 | 1 | 12691930 | 12691982 | Adineta vaga 104782 | TTC|GTATGTTTTA...ACTCTTTTATCG/GTATTACTAATT...TTTAG|TGG | 2 | 1 | 7.153 |
3822764 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|004107-T1 720813 | 2 | 12691694 | 12691753 | Adineta vaga 104782 | CAG|GTAATACACT...TATTTCTTTTTC/AATAGGATGATA...GTTAG|ATC | 1 | 1 | 10.602 |
3822765 | GT-AG | 0 | 1.000000099473604e-05 | 614 | rna-gnl|I4U23|004107-T1 720813 | 3 | 12690927 | 12691540 | Adineta vaga 104782 | CAG|GTAAGAGAAC...TTTTTTCTATTT/TTTCGATTCAGT...ATTAG|CAA | 1 | 1 | 13.6 |
3822766 | GT-AG | 0 | 0.0001479333667723 | 54 | rna-gnl|I4U23|004107-T1 720813 | 4 | 12690584 | 12690637 | Adineta vaga 104782 | TTC|GTAAGTTTAA...TCGTACTTACTT/TTTATATTCATT...TTTAG|TGG | 2 | 1 | 19.263 |
3822767 | GT-AG | 0 | 1.000000099473604e-05 | 60 | rna-gnl|I4U23|004107-T1 720813 | 5 | 12690348 | 12690407 | Adineta vaga 104782 | CAG|GTAATAAACT...ATTTCTTTGTCC/AATAGGATGATA...GCTAG|ATC | 1 | 1 | 22.712 |
3822768 | GT-AG | 0 | 1.000000099473604e-05 | 375 | rna-gnl|I4U23|004107-T1 720813 | 6 | 12689820 | 12690194 | Adineta vaga 104782 | CAG|GTAAGAATTG...TTTCTTTTACCA/TTTTCTTTTACC...ATTAG|TAA | 1 | 1 | 25.71 |
3822769 | GT-AG | 0 | 0.0016809491236336 | 55 | rna-gnl|I4U23|004107-T1 720813 | 7 | 12689091 | 12689145 | Adineta vaga 104782 | GCA|GTAAACTATT...ATATCTTTCTCT/CTGTGTCTCACA...TTTAG|GAG | 0 | 1 | 38.918 |
3822770 | GT-AG | 0 | 0.0002223880310992 | 63 | rna-gnl|I4U23|004107-T1 720813 | 8 | 12688933 | 12688995 | Adineta vaga 104782 | CTC|GTAAATATAA...TCTTTTTTGAAC/ATAATATTGATA...ATTAG|GCG | 2 | 1 | 40.78 |
3822771 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-gnl|I4U23|004107-T1 720813 | 9 | 12688746 | 12688795 | Adineta vaga 104782 | AGG|GTTAATTGAA...ACATTCATAATC/ATCATATTTATC...TATAG|ATA | 1 | 1 | 43.465 |
3822772 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-gnl|I4U23|004107-T1 720813 | 10 | 12688588 | 12688641 | Adineta vaga 104782 | CAG|GTAATACTTG...ATTTTCATAACT/TTTATTTTCATA...TGTAG|TTG | 0 | 1 | 45.503 |
3822773 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|004107-T1 720813 | 11 | 12688450 | 12688504 | Adineta vaga 104782 | CTG|GTTAGTATTA...TTTTTCTCATTC/CTTTTTCTCATT...TCAAG|GAA | 2 | 1 | 47.129 |
3822774 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-gnl|I4U23|004107-T1 720813 | 12 | 12688174 | 12688225 | Adineta vaga 104782 | AAA|GTAAGAAAAT...GTTTTTTCAATA/TGTTTTTTCAAT...TACAG|AAG | 1 | 1 | 51.519 |
3822775 | GT-AG | 0 | 3.5877972930700844e-05 | 75 | rna-gnl|I4U23|004107-T1 720813 | 13 | 12687895 | 12687969 | Adineta vaga 104782 | CAA|GTAAGTTCTT...CTATGTTTAATG/CTATGTTTAATG...TATAG|TCA | 1 | 1 | 55.516 |
3822776 | GT-AG | 0 | 2.208330516512853e-05 | 54 | rna-gnl|I4U23|004107-T1 720813 | 14 | 12687295 | 12687348 | Adineta vaga 104782 | CAG|GTAAAATTTC...AATATCTTAATC/AATATCTTAATC...GCTAG|TAG | 1 | 1 | 66.216 |
3822777 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-gnl|I4U23|004107-T1 720813 | 15 | 12686693 | 12686745 | Adineta vaga 104782 | GAG|GTAAGAAAAT...AGTTCTATATCG/TTTCGTTTCAAT...TCTAG|TGG | 1 | 1 | 76.974 |
3822778 | GT-AG | 0 | 1.000000099473604e-05 | 55 | rna-gnl|I4U23|004107-T1 720813 | 16 | 12686092 | 12686146 | Adineta vaga 104782 | CAG|GTAAGATCTT...ATATTCATATAA/TATATACTAACT...TTTAG|ATG | 1 | 1 | 87.674 |
3822779 | GT-AG | 0 | 3.4091169929491495e-05 | 55 | rna-gnl|I4U23|004107-T1 720813 | 17 | 12685971 | 12686025 | Adineta vaga 104782 | CAG|GTTTGTTATA...AGGTCGTTAATA/TTAATATTCACT...TTTAG|CTG | 1 | 1 | 88.967 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" ( "id" INTEGER, "dinucleotide_pair" TEXT, "is_minor" INTEGER, "score" REAL, "length" INTEGER, "transcript_id" INTEGER, "ordinal_index" INTEGER, "start" INTEGER, "end" INTEGER, "taxonomy_id" INTEGER, "scored_motifs" TEXT, "phase" INTEGER, "in_cds" INTEGER, "relative_position" REAL ,PRIMARY KEY ([id]), FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]), FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id]) ); CREATE INDEX [idx_introns_transcript_id] ON [introns] ([transcript_id]); CREATE INDEX [idx_introns_taxonomy_id] ON [introns] ([taxonomy_id]); CREATE INDEX [idx_introns_phase] ON [introns] ([phase]); CREATE INDEX [idx_introns_is_minor] ON [introns] ([is_minor]); CREATE INDEX [idx_introns_dinucleotide_pair] ON [introns] ([dinucleotide_pair]); CREATE INDEX [idx_introns_score] ON [introns] ([score]); CREATE INDEX [idx_introns_in_cds] ON [introns] ([in_cds]);