introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
29 rows where transcript_id = 3982004
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20494601 | GT-AG | 0 | 1.000000099473604e-05 | 77691 | rna-XM_036863344.1 3982004 | 1 | 28232899 | 28310589 | Balaenoptera musculus 9771 | AGG|GTAGGTGAGG...TTTTTCTTTTTT/ATGATAATAATG...TTCAG|TTT | 0 | 1 | 0.839 |
| 20494602 | GT-AG | 0 | 8.415838957318387e-05 | 39521 | rna-XM_036863344.1 3982004 | 2 | 28310636 | 28350156 | Balaenoptera musculus 9771 | ACT|GTAAGTACCG...TCATCTTTAGCA/CAGTATTTTACT...TTAAG|GTG | 1 | 1 | 1.828 |
| 20494603 | GT-AG | 0 | 1.000000099473604e-05 | 1730 | rna-XM_036863344.1 3982004 | 3 | 28350682 | 28352411 | Balaenoptera musculus 9771 | CAG|GTAATAATTG...TTTGTTTTGTTT/TTGTTATTCATA...AATAG|CTA | 1 | 1 | 13.118 |
| 20494604 | GT-AG | 0 | 4.168619785152384e-05 | 356 | rna-XM_036863344.1 3982004 | 4 | 28352474 | 28352829 | Balaenoptera musculus 9771 | AGG|GTAAGTTTTC...TAATCCTTTATA/TATTTATTAATC...TTTAG|GAT | 0 | 1 | 14.452 |
| 20494605 | GT-AG | 0 | 0.0011885434142229 | 6852 | rna-XM_036863344.1 3982004 | 5 | 28353001 | 28359852 | Balaenoptera musculus 9771 | CAA|GTAAACATTT...TTTTCTTTACAT/TATTTATTTATT...AAAAG|GAG | 0 | 1 | 18.129 |
| 20494606 | GT-AG | 0 | 1.000000099473604e-05 | 231 | rna-XM_036863344.1 3982004 | 6 | 28359938 | 28360168 | Balaenoptera musculus 9771 | TTG|GTAAGAAGCA...TTTTTTTCAATT/TTTTTTTTCAAT...TAAAG|CTC | 1 | 1 | 19.957 |
| 20494607 | GT-AG | 0 | 0.0003355315546992 | 3845 | rna-XM_036863344.1 3982004 | 7 | 28360425 | 28364269 | Balaenoptera musculus 9771 | AAA|GTATAGCAGA...CTTATTTTATCT/TTTGTTGTCATT...TTTAG|AGA | 2 | 1 | 25.462 |
| 20494608 | GT-AG | 0 | 0.0004484195540219 | 111 | rna-XM_036863344.1 3982004 | 8 | 28364445 | 28364555 | Balaenoptera musculus 9771 | GTT|GTAAGTTATC...CAGCTTTTGATG/TTGTTTCTAAGT...CCTAG|ATT | 0 | 1 | 29.226 |
| 20494609 | GT-AG | 0 | 1.561518629540321e-05 | 3535 | rna-XM_036863344.1 3982004 | 9 | 28364769 | 28368303 | Balaenoptera musculus 9771 | CAG|GTACATGACC...GTTATTTTATTC/TGTTATTTTATT...AATAG|AAA | 0 | 1 | 33.806 |
| 20494610 | GT-AG | 0 | 1.000000099473604e-05 | 498 | rna-XM_036863344.1 3982004 | 10 | 28368455 | 28368952 | Balaenoptera musculus 9771 | CAG|GTAATACTGG...TTTATTTTGATT/TTTATTTTGATT...CTAAG|GTG | 1 | 1 | 37.054 |
| 20494611 | GT-AG | 0 | 3.532061411889399e-05 | 430 | rna-XM_036863344.1 3982004 | 11 | 28369069 | 28369498 | Balaenoptera musculus 9771 | CAG|GTATGGCTTC...CTTGTTTGAACA/AGTAGATTGATC...TTCAG|AAT | 0 | 1 | 39.548 |
| 20494612 | GT-AG | 0 | 1.000000099473604e-05 | 173 | rna-XM_036863344.1 3982004 | 12 | 28369775 | 28369947 | Balaenoptera musculus 9771 | AAG|GTAGAGTATA...TAGTACTTAATG/AAATGTTTAATT...TGCAG|GGT | 0 | 1 | 45.484 |
| 20494613 | GT-AG | 0 | 1.000000099473604e-05 | 800 | rna-XM_036863344.1 3982004 | 13 | 28370099 | 28370898 | Balaenoptera musculus 9771 | AAG|GTAAAGATAG...TTAATTTTATCT/TAGATACTTATT...TTTAG|GTT | 1 | 1 | 48.731 |
| 20494614 | GT-AG | 0 | 1.000000099473604e-05 | 783 | rna-XM_036863344.1 3982004 | 14 | 28371072 | 28371854 | Balaenoptera musculus 9771 | CAG|GTAAATATAA...TGTTTTTTACTT/TTGTTTTTTACT...TTCAG|ATG | 0 | 1 | 52.452 |
| 20494615 | GT-AG | 0 | 1.9423321904300487e-05 | 555 | rna-XM_036863344.1 3982004 | 15 | 28372006 | 28372560 | Balaenoptera musculus 9771 | CAG|GTAAAATTAA...ATTTTCTTATTT/GATTTTCTTATT...ATCAG|CAA | 1 | 1 | 55.699 |
| 20494616 | GT-AG | 0 | 0.0001293367152945 | 332 | rna-XM_036863344.1 3982004 | 16 | 28372702 | 28373033 | Balaenoptera musculus 9771 | AAG|GTATGACTTG...ATGTTTTTATTT/TATGTTTTTATT...ATCAG|GTG | 1 | 1 | 58.731 |
| 20494617 | GT-AG | 0 | 0.0002211331981918 | 960 | rna-XM_036863344.1 3982004 | 17 | 28373072 | 28374031 | Balaenoptera musculus 9771 | GAT|GTAAGTTTTA...TTTCTTGTAATT/TGGTTTCTGACT...ACTAG|GCA | 0 | 1 | 59.548 |
| 20494618 | GT-AG | 0 | 2.8814514439424303e-05 | 2808 | rna-XM_036863344.1 3982004 | 18 | 28374177 | 28376984 | Balaenoptera musculus 9771 | CAG|GTTTGTTCCT...TTTTTTTTTTCT/TCGTTTCTAACC...ATTAG|AAG | 1 | 1 | 62.667 |
| 20494619 | GT-AG | 0 | 1.000000099473604e-05 | 710 | rna-XM_036863344.1 3982004 | 19 | 28377104 | 28377813 | Balaenoptera musculus 9771 | CCA|GTGAGTAATC...ATAGTTTTGATG/ATAGTTTTGATG...TTCAG|ATG | 0 | 1 | 65.226 |
| 20494620 | GT-AG | 0 | 1.000000099473604e-05 | 2956 | rna-XM_036863344.1 3982004 | 20 | 28377986 | 28380941 | Balaenoptera musculus 9771 | GAG|GTTGGTATAA...ATTGACTTATTA/AATTGACTTATT...TTTAG|GAT | 1 | 1 | 68.925 |
| 20494621 | GT-AG | 0 | 1.4859476270361512e-05 | 413 | rna-XM_036863344.1 3982004 | 21 | 28380996 | 28381408 | Balaenoptera musculus 9771 | AAG|GTTTGTAATT...AGCGTCTTATCT/TCTTATCTCATT...TTCAG|ACC | 1 | 1 | 70.086 |
| 20494622 | GT-AG | 0 | 1.000000099473604e-05 | 417 | rna-XM_036863344.1 3982004 | 22 | 28381589 | 28382005 | Balaenoptera musculus 9771 | CAG|GTTTGTGCAG...TTTCTCTTGGTT/ATTTATCTCACA...CGTAG|AAT | 1 | 1 | 73.957 |
| 20494623 | GT-AG | 0 | 0.000100648372662 | 11621 | rna-XM_036863344.1 3982004 | 23 | 28382101 | 28393721 | Balaenoptera musculus 9771 | AGA|GTAAGCAGTA...AAATTTTTGAAA/CTAAATTTTATG...TACAG|GGA | 0 | 1 | 76.0 |
| 20494624 | GT-AG | 0 | 1.000000099473604e-05 | 184 | rna-XM_036863344.1 3982004 | 24 | 28393891 | 28394074 | Balaenoptera musculus 9771 | GTG|GTAAATACAC...TTCTCATTAATT/AAACTTCTCATT...GTCAG|GAG | 1 | 1 | 79.634 |
| 20494625 | GT-AG | 0 | 1.000000099473604e-05 | 2216 | rna-XM_036863344.1 3982004 | 25 | 28394171 | 28396386 | Balaenoptera musculus 9771 | GAG|GTAAAACTTT...ATGTTATTATTT/AATGTTATTATT...CACAG|TCC | 1 | 1 | 81.699 |
| 20494626 | GT-AG | 0 | 1.000000099473604e-05 | 2580 | rna-XM_036863344.1 3982004 | 26 | 28396539 | 28399118 | Balaenoptera musculus 9771 | CAG|GTATGAAATT...TTTTTTGTATCA/TTTTGTATCAAC...TGTAG|TAC | 0 | 1 | 84.968 |
| 20494627 | GT-AG | 0 | 1.000000099473604e-05 | 1485 | rna-XM_036863344.1 3982004 | 27 | 28399231 | 28400715 | Balaenoptera musculus 9771 | ATG|GTAGGATGAT...GTTTCTTTTTCC/CAGGAGCTTAGT...TTCAG|GTT | 1 | 1 | 87.376 |
| 20494628 | GT-AG | 0 | 1.000000099473604e-05 | 8832 | rna-XM_036863344.1 3982004 | 28 | 28400965 | 28409796 | Balaenoptera musculus 9771 | CAG|GTGTGTGGTG...AATACTTTGATA/ACTTTTCTAATA...TCAAG|ATA | 1 | 1 | 92.731 |
| 20494629 | GT-AG | 0 | 1.000000099473604e-05 | 872 | rna-XM_036863344.1 3982004 | 29 | 28409927 | 28410798 | Balaenoptera musculus 9771 | ATG|GTAAGTGATT...TTTTTCTTTTTT/TACTCTCTGATC...GATAG|TTC | 2 | 1 | 95.527 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);