introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
26 rows where transcript_id = 10082871
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 55388187 | GT-AG | 0 | 1.000000099473604e-05 | 276 | rna-XM_023122857.1 10082871 | 1 | 2605091 | 2605366 | Cucurbita maxima 3661 | CAG|GTGAGAACAT...CTTTGTTTATCT/GTTTATCTGATA...AGCAG|GTT | 2 | 1 | 9.773 |
| 55388188 | GT-AG | 0 | 1.000000099473604e-05 | 92 | rna-XM_023122857.1 10082871 | 2 | 2604821 | 2604912 | Cucurbita maxima 3661 | AAG|GTGGTGTATC...TGTTCTTTATTC/TCTTTATTCATT...TGCAG|TTA | 0 | 1 | 12.5 |
| 55388189 | GT-AG | 0 | 1.000000099473604e-05 | 1496 | rna-XM_023122857.1 10082871 | 3 | 2603211 | 2604706 | Cucurbita maxima 3661 | AAG|GTAGTGCTAA...TCTTTCTTAAAA/ATCTTTCTTAAA...TTCAG|GAT | 0 | 1 | 14.246 |
| 55388190 | GT-AG | 0 | 0.0965682506859832 | 89 | rna-XM_023122857.1 10082871 | 4 | 2602987 | 2603075 | Cucurbita maxima 3661 | GAG|GTATGCTATT...TTAACCTTATTT/CTAATGTTAACC...AATAG|GCA | 0 | 1 | 16.314 |
| 55388191 | GC-AG | 0 | 1.000000099473604e-05 | 582 | rna-XM_023122857.1 10082871 | 5 | 2601941 | 2602522 | Cucurbita maxima 3661 | AAG|GCAAGTTATT...ATTTTTTTACCT/GATTTTTTTACC...TTCAG|GCG | 2 | 1 | 23.422 |
| 55388192 | GT-AG | 0 | 0.0001425644395606 | 81 | rna-XM_023122857.1 10082871 | 6 | 2601751 | 2601831 | Cucurbita maxima 3661 | AAG|GTTGCCACAA...TTTTTCTTTTCT/TCTTTTCTTATT...CACAG|AGG | 0 | 1 | 25.092 |
| 55388193 | GT-AG | 0 | 0.0002736258837599 | 790 | rna-XM_023122857.1 10082871 | 7 | 2600835 | 2601624 | Cucurbita maxima 3661 | GAG|GTACGTTTTT...CTCGCCATATTT/ATATTTCTCATA...TACAG|GCC | 0 | 1 | 27.022 |
| 55388194 | GT-AG | 0 | 0.0002334973096813 | 75 | rna-XM_023122857.1 10082871 | 8 | 2600466 | 2600540 | Cucurbita maxima 3661 | AAG|GTAACGTCAA...CTGATTTTAATA/CTGATTTTAATA...CTCAG|ATG | 0 | 1 | 31.526 |
| 55388195 | GT-AG | 0 | 1.000000099473604e-05 | 95 | rna-XM_023122857.1 10082871 | 9 | 2600254 | 2600348 | Cucurbita maxima 3661 | CAG|GTTAAGTTAT...AATTGTTTGATA/ATAGTGTTTACT...TACAG|GAA | 0 | 1 | 33.318 |
| 55388196 | GT-AG | 0 | 1.530684321557883e-05 | 125 | rna-XM_023122857.1 10082871 | 10 | 2599907 | 2600031 | Cucurbita maxima 3661 | ATG|GTAAGTTTGT...TGTTTTTGAGTG/ATACAATTGACA...AACAG|AAT | 0 | 1 | 36.719 |
| 55388197 | GT-AG | 0 | 1.000000099473604e-05 | 176 | rna-XM_023122857.1 10082871 | 11 | 2599620 | 2599795 | Cucurbita maxima 3661 | AAG|GTGTTAAACG...TATTTTTTATTT/TTATTTTTTATT...AGTAG|CAA | 0 | 1 | 38.419 |
| 55388198 | GT-AG | 0 | 2.952851290556449e-05 | 100 | rna-XM_023122857.1 10082871 | 12 | 2599376 | 2599475 | Cucurbita maxima 3661 | AAG|GTCTGTCTCT...ATTATTTTATTA/AATTATTTTATT...TTCAG|GGT | 0 | 1 | 40.625 |
| 55388199 | GT-AG | 0 | 0.0482492392794343 | 95 | rna-XM_023122857.1 10082871 | 13 | 2598912 | 2599006 | Cucurbita maxima 3661 | CAT|GTATTTCTCT...TTTTTCTTATGT/CTTTTTCTTATG...TGCAG|ATT | 0 | 1 | 46.278 |
| 55388200 | GT-AG | 0 | 1.000000099473604e-05 | 1610 | rna-XM_023122857.1 10082871 | 14 | 2597212 | 2598821 | Cucurbita maxima 3661 | GAG|GTAAGTCCAT...ATCTTCTAATTA/CATCTTCTAATT...AATAG|GCT | 0 | 1 | 47.656 |
| 55388201 | GT-AG | 0 | 2.1293553769117155e-05 | 95 | rna-XM_023122857.1 10082871 | 15 | 2596853 | 2596947 | Cucurbita maxima 3661 | CAG|GTACAACTTT...ACTTCCATAACA/ACATATTTCATA...TCCAG|GTT | 0 | 1 | 51.7 |
| 55388202 | GT-AG | 0 | 6.521105489243334e-05 | 91 | rna-XM_023122857.1 10082871 | 16 | 2596638 | 2596728 | Cucurbita maxima 3661 | TAG|GTATATAATA...TATTGTTTGTCA/TTGTTTGTCAAA...GGCAG|GAA | 1 | 1 | 53.6 |
| 55388203 | GT-AG | 0 | 0.0014670203504164 | 339 | rna-XM_023122857.1 10082871 | 17 | 2595564 | 2595902 | Cucurbita maxima 3661 | TAG|GTATCAACAA...ATTATTTTCACT/ATTATTTTCACT...CCTAG|GTA | 1 | 1 | 64.859 |
| 55388204 | GT-AG | 0 | 0.0023849583565228 | 438 | rna-XM_023122857.1 10082871 | 18 | 2594888 | 2595325 | Cucurbita maxima 3661 | CAA|GTATGTATAG...ATTCCCTTTGCT/TTGCTGTTTACT...GGCAG|GAT | 2 | 1 | 68.505 |
| 55388205 | GT-AG | 0 | 0.0001444868358032 | 534 | rna-XM_023122857.1 10082871 | 19 | 2594251 | 2594784 | Cucurbita maxima 3661 | AAG|GTCTATAGCT...TTTCTCTTAATG/TTAATTTTGATT...AATAG|GTT | 0 | 1 | 70.083 |
| 55388206 | GT-AG | 0 | 0.0004046251391569 | 217 | rna-XM_023122857.1 10082871 | 20 | 2593836 | 2594052 | Cucurbita maxima 3661 | AAG|GTACGCTGTT...ATGGATTTAACG/TATGGGCTCATA...TTCAG|GGT | 0 | 1 | 73.116 |
| 55388207 | GT-AG | 0 | 1.718102915361481e-05 | 81 | rna-XM_023122857.1 10082871 | 21 | 2593717 | 2593797 | Cucurbita maxima 3661 | TAG|GTAAATAGCC...AAATTCTTAACC/AAATTCTTAACC...TGTAG|AAG | 2 | 1 | 73.698 |
| 55388208 | GT-AG | 0 | 1.000000099473604e-05 | 86 | rna-XM_023122857.1 10082871 | 22 | 2593401 | 2593486 | Cucurbita maxima 3661 | AAG|GTACTGTAGC...GTATCCTAAGTT/AATGTGCTCAAT...GGTAG|AAG | 1 | 1 | 77.221 |
| 55388209 | GT-AG | 0 | 6.157245961721789e-05 | 179 | rna-XM_023122857.1 10082871 | 23 | 2592978 | 2593156 | Cucurbita maxima 3661 | GGG|GTAGGTTACG...CATCACTTAACC/CTTTCCATCACT...AACAG|AAC | 2 | 1 | 80.959 |
| 55388210 | GT-AG | 0 | 0.0194238923677001 | 153 | rna-XM_023122857.1 10082871 | 24 | 2592425 | 2592577 | Cucurbita maxima 3661 | CAG|GTACCTTCAT...AATTGTTTAGTC/TGTTTAGTCATT...ACCAG|ATG | 0 | 1 | 87.086 |
| 55388211 | GT-AG | 0 | 0.0014976647772509 | 122 | rna-XM_023122857.1 10082871 | 25 | 2592063 | 2592184 | Cucurbita maxima 3661 | GAG|GTAAGCTTTT...TGATCCTTATCA/TTTTTTTTTAAT...AACAG|GGG | 0 | 1 | 90.763 |
| 55388212 | GT-AG | 0 | 2.357717747363768e-05 | 113 | rna-XM_023122857.1 10082871 | 26 | 2591862 | 2591974 | Cucurbita maxima 3661 | GTG|GTAAATTCTC...TACTCTTTTGCT/CTTTTGCTAACG...TACAG|GAG | 1 | 1 | 92.111 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);