introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
27 rows where transcript_id = 623764
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3437002 | GT-AG | 0 | 0.0004386836627068 | 52 | rna-EDS130_LOCUS117 623764 | 1 | 310639 | 310690 | Adineta ricciae 249248 | ACC|GTACGTGTTA...CGTGTTTTTATA/CGTGTTTTTATA...TTTAG|ATT | 2 | 1 | 0.759 |
| 3437003 | GT-AG | 0 | 0.0002737600548136 | 54 | rna-EDS130_LOCUS117 623764 | 2 | 310729 | 310782 | Adineta ricciae 249248 | AAA|GTTTGTTGAC...ATTTTCTTTTCG/TATCAAATAATC...TTTAG|ACG | 1 | 1 | 1.582 |
| 3437004 | GT-AG | 0 | 1.6950104850610713e-05 | 59 | rna-EDS130_LOCUS117 623764 | 3 | 310930 | 310988 | Adineta ricciae 249248 | CAG|GTACTTCATT...TCATTCATGAAC/TGGATTTTCATT...CAAAG|AAA | 1 | 1 | 4.768 |
| 3437005 | GT-AG | 0 | 4.846485938839936e-05 | 226 | rna-EDS130_LOCUS117 623764 | 4 | 311081 | 311306 | Adineta ricciae 249248 | GAG|GTAAGCTTCG...TCACAGTTAACT/AGTTAACTAATT...TTTAG|AAA | 0 | 1 | 6.762 |
| 3437006 | GT-AG | 0 | 2.050829690329376e-05 | 49 | rna-EDS130_LOCUS117 623764 | 5 | 311391 | 311439 | Adineta ricciae 249248 | TTG|GTAAGTTTGA...ATTTTTCTATTG/CATTTTTCTATT...TTCAG|TTG | 0 | 1 | 8.583 |
| 3437007 | GT-AG | 0 | 1.000000099473604e-05 | 43 | rna-EDS130_LOCUS117 623764 | 6 | 311490 | 311532 | Adineta ricciae 249248 | GCG|GTAAGATATT...TTATCATTGTCT/GTGGTTATCATT...TTTAG|ATG | 2 | 1 | 9.666 |
| 3437008 | GT-AG | 0 | 1.1538363574772494e-05 | 62 | rna-EDS130_LOCUS117 623764 | 7 | 311615 | 311676 | Adineta ricciae 249248 | AAA|GTAATTGTGA...TGATCTTTCGTT/GAGAGACTGATC...ATTAG|ATT | 0 | 1 | 11.443 |
| 3437009 | GT-AG | 0 | 1.000000099473604e-05 | 65 | rna-EDS130_LOCUS117 623764 | 8 | 311818 | 311882 | Adineta ricciae 249248 | AAA|GTGAGTGATC...TCCATTTGAATT/GAATTGCTAATT...TCTAG|TCA | 0 | 1 | 14.499 |
| 3437010 | GT-AG | 0 | 1.1788929723321394e-05 | 532 | rna-EDS130_LOCUS117 623764 | 9 | 312026 | 312557 | Adineta ricciae 249248 | TCG|GTAAGCATAT...AAAAATTTATCT/TAAAAATTTATC...TTTAG|TGT | 2 | 1 | 17.599 |
| 3437011 | GT-AG | 0 | 1.000000099473604e-05 | 57 | rna-EDS130_LOCUS117 623764 | 10 | 312761 | 312817 | Adineta ricciae 249248 | GTG|GTTTGTAATT...AAAGTTTCAATC/GAAAGTTTCAAT...CTTAG|TAT | 1 | 1 | 21.998 |
| 3437012 | GT-AG | 0 | 1.9264510212346897e-05 | 55 | rna-EDS130_LOCUS117 623764 | 11 | 313077 | 313131 | Adineta ricciae 249248 | AAG|GTTTGTTCGG...CATTTCTCGAAA/ATGATTGTGATC...TCTAG|TTA | 2 | 1 | 27.612 |
| 3437013 | GT-AG | 0 | 1.000000099473604e-05 | 51 | rna-EDS130_LOCUS117 623764 | 12 | 313224 | 313274 | Adineta ricciae 249248 | TGG|GTAAAAAATT...CTTTTGTTGACC/CTTTTGTTGACC...TTCAG|AGG | 1 | 1 | 29.606 |
| 3437014 | GT-AG | 0 | 2.544667348220073e-05 | 60 | rna-EDS130_LOCUS117 623764 | 13 | 313381 | 313440 | Adineta ricciae 249248 | GTC|GTAAATAAAT...AAGTCTTTTTCC/ATGGAGATAAAC...CATAG|ATT | 2 | 1 | 31.903 |
| 3437015 | GT-AG | 0 | 1.000000099473604e-05 | 67 | rna-EDS130_LOCUS117 623764 | 14 | 313527 | 313593 | Adineta ricciae 249248 | GAT|GTGAGTTAAA...CAATTTTTCGTA/AAGATCGTCAAT...TTTAG|ATC | 1 | 1 | 33.767 |
| 3437016 | GT-AG | 0 | 1.000000099473604e-05 | 54 | rna-EDS130_LOCUS117 623764 | 15 | 314002 | 314055 | Adineta ricciae 249248 | TGG|GTAAAATGGA...GAATTTTTTGCA/ATTTTTTGCAAT...TTTAG|GTC | 1 | 1 | 42.609 |
| 3437017 | GT-AG | 0 | 1.000000099473604e-05 | 49 | rna-EDS130_LOCUS117 623764 | 16 | 314255 | 314303 | Adineta ricciae 249248 | ACG|GTAAATGTAA...ACTACTTGAAAA/GTATGGATCAAT...TTTAG|TAT | 2 | 1 | 46.922 |
| 3437018 | GT-AG | 0 | 1.000000099473604e-05 | 53 | rna-EDS130_LOCUS117 623764 | 17 | 314456 | 314508 | Adineta ricciae 249248 | AAA|GTTCGTACAT...CGAAATTTGACA/ATCTCGCTCATC...TTCAG|AAA | 1 | 1 | 50.217 |
| 3437019 | GT-AG | 0 | 1.000000099473604e-05 | 52 | rna-EDS130_LOCUS117 623764 | 18 | 314771 | 314822 | Adineta ricciae 249248 | CGA|GTAAGAATTT...TTATTTTGAAAA/TTGTGAATCATA...ATTAG|TGA | 2 | 1 | 55.895 |
| 3437020 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-EDS130_LOCUS117 623764 | 19 | 314884 | 314933 | Adineta ricciae 249248 | CAA|GTGAGAAATG...ATCTCTTTCATT/ATCTCTTTCATT...TACAG|GGT | 0 | 1 | 57.217 |
| 3437021 | GT-AG | 0 | 0.0001916852256475 | 52 | rna-EDS130_LOCUS117 623764 | 20 | 315131 | 315182 | Adineta ricciae 249248 | TCA|GTTCGTTGGA...CTTTCTTCGACT/ATAGTAATAATT...TTTAG|ACA | 2 | 1 | 61.487 |
| 3437022 | GT-AG | 0 | 0.0407120969485439 | 52 | rna-EDS130_LOCUS117 623764 | 21 | 315358 | 315409 | Adineta ricciae 249248 | GGA|GTATGTTTCG...TCTATCTTCGAG/CCGGTGTATATT...TTTAG|TTA | 0 | 1 | 65.28 |
| 3437023 | GT-AG | 0 | 0.0008733099487158 | 56 | rna-EDS130_LOCUS117 623764 | 22 | 315623 | 315678 | Adineta ricciae 249248 | CGT|GTATGATCAT...GCTGCCATAGTT/AAATTTGTTATT...TGTAG|CCA | 0 | 1 | 69.896 |
| 3437024 | GT-AG | 0 | 0.0001271960630099 | 52 | rna-EDS130_LOCUS117 623764 | 23 | 315741 | 315792 | Adineta ricciae 249248 | TAC|GTAAGATATT...TTCCCCTTAACA/TTTGTTCCTATT...TTCAG|AGT | 2 | 1 | 71.24 |
| 3437025 | GT-AG | 0 | 1.000000099473604e-05 | 50 | rna-EDS130_LOCUS117 623764 | 24 | 316013 | 316062 | Adineta ricciae 249248 | CAA|GTGAGAGAAA...AAACTTTTGATA/AAACTTTTGATA...TTTAG|TAT | 0 | 1 | 76.008 |
| 3437026 | GT-AG | 0 | 5.2660552849570506e-05 | 59 | rna-EDS130_LOCUS117 623764 | 25 | 316166 | 316224 | Adineta ricciae 249248 | ATT|GTAAATATGC...TCATTCTCGAAT/GTTCATTTCATT...TTTAG|TCC | 1 | 1 | 78.24 |
| 3437027 | GT-AG | 0 | 1.000000099473604e-05 | 56 | rna-EDS130_LOCUS117 623764 | 26 | 316720 | 316775 | Adineta ricciae 249248 | GAG|GTAAAGCACT...TTTATTTCGATC/TTCGATCTCACA...TCAAG|GTC | 1 | 1 | 88.968 |
| 3437028 | GT-AG | 0 | 0.0001023264586409 | 47 | rna-EDS130_LOCUS117 623764 | 27 | 316888 | 316934 | Adineta ricciae 249248 | ACG|GTTTGTTCCA...TGGATCTCGACA/GATTTGTTCATG...TTTAG|CTA | 2 | 1 | 91.396 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);