introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 21436516
This data as json, CSV (advanced)
Suggested facets: dinucleotide_pair, score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 115633147 | GC-AG | 0 | 1.000000099473604e-05 | 9432 | rna-XM_034060693.1 21436516 | 1 | 24160628 | 24170059 | Melopsittacus undulatus 13146 | GCG|GCGGCGAAAG...GCTCTCTAGAAA/AAGTTTGTGAGC...TGCAG|ATA | 1 | 1 | 11.946 |
| 115633148 | GT-AG | 0 | 1.000000099473604e-05 | 464 | rna-XM_034060693.1 21436516 | 2 | 24159969 | 24160432 | Melopsittacus undulatus 13146 | CAG|GTAGGTCTGT...ACATTCTTGCTG/GCATTTCTCAGA...AATAG|GTG | 1 | 1 | 14.208 |
| 115633149 | GT-AG | 0 | 7.442043061273612e-05 | 1290 | rna-XM_034060693.1 21436516 | 3 | 24158453 | 24159742 | Melopsittacus undulatus 13146 | AAA|GTAAGTTCTA...GTTGTTTTGAAG/GTTGTTTTGAAG...TACAG|AGT | 2 | 1 | 16.829 |
| 115633150 | GT-AG | 0 | 1.000000099473604e-05 | 388 | rna-XM_034060693.1 21436516 | 4 | 24157860 | 24158247 | Melopsittacus undulatus 13146 | GAG|GTAGGTGGAT...ATTTTCCTAATG/ATTTTCCTAATG...CTCAG|GTC | 0 | 1 | 19.207 |
| 115633151 | GT-AG | 0 | 1.000000099473604e-05 | 4392 | rna-XM_034060693.1 21436516 | 5 | 24153356 | 24157747 | Melopsittacus undulatus 13146 | AAG|GTAAGGACCA...CTGTTGTTAAAT/CTGTTGTTAAAT...CACAG|TGC | 1 | 1 | 20.506 |
| 115633152 | GT-AG | 0 | 0.0007151422789847 | 942 | rna-XM_034060693.1 21436516 | 6 | 24152164 | 24153105 | Melopsittacus undulatus 13146 | CAT|GTAAGCCAGC...GTTCCCTTATCT/GTTGTTCTGATA...ATCAG|CTA | 2 | 1 | 23.405 |
| 115633153 | GT-AG | 0 | 1.000000099473604e-05 | 347 | rna-XM_034060693.1 21436516 | 7 | 24151674 | 24152020 | Melopsittacus undulatus 13146 | AAG|GTAAGGAAAG...TTTGTTTGGACA/AATGAAATGACA...TTTAG|GTA | 1 | 1 | 25.064 |
| 115633154 | GT-AG | 0 | 1.000000099473604e-05 | 336 | rna-XM_034060693.1 21436516 | 8 | 24151165 | 24151500 | Melopsittacus undulatus 13146 | GAG|GTAAGATGCC...CTCTGTGTAATA/TTGGGTGTTATT...TCTAG|TGC | 0 | 1 | 27.07 |
| 115633155 | GT-AG | 0 | 1.000000099473604e-05 | 1213 | rna-XM_034060693.1 21436516 | 9 | 24149783 | 24150995 | Melopsittacus undulatus 13146 | CAG|GTCAGTAGCT...GGTCCCTTTCTG/TGCAAATTCAGT...CCCAG|GAG | 1 | 1 | 29.03 |
| 115633156 | GT-AG | 0 | 1.000000099473604e-05 | 3024 | rna-XM_034060693.1 21436516 | 10 | 24146588 | 24149611 | Melopsittacus undulatus 13146 | AGT|GTGAGTGTTT...ATATCTTTGTCT/GACCTGCTGACA...TGAAG|ACT | 1 | 1 | 31.014 |
| 115633157 | GT-AG | 0 | 1.000000099473604e-05 | 124 | rna-XM_034060693.1 21436516 | 11 | 24146291 | 24146414 | Melopsittacus undulatus 13146 | CAG|GTAATGCAGC...ATCTCCTTCATT/ATCTCCTTCATT...CATAG|ACA | 0 | 1 | 33.02 |
| 115633158 | GT-AG | 0 | 0.0014280401942588 | 897 | rna-XM_034060693.1 21436516 | 12 | 24145226 | 24146122 | Melopsittacus undulatus 13146 | GTG|GTATGTTAAA...AGCCCCTTGCTT/CCTTGCTTCATG...TTCAG|ACC | 0 | 1 | 34.969 |
| 115633159 | GT-AG | 0 | 0.0013163504548927 | 1816 | rna-XM_034060693.1 21436516 | 13 | 24143160 | 24144975 | Melopsittacus undulatus 13146 | CAG|GTAAGCTTCG...ATTTTCTTATCA/CATTTTCTTATC...TGCAG|ATG | 1 | 1 | 37.868 |
| 115633160 | GT-AG | 0 | 1.000000099473604e-05 | 1154 | rna-XM_034060693.1 21436516 | 14 | 24141878 | 24143031 | Melopsittacus undulatus 13146 | GAG|GTTTGACCTT...CCACCCTTAACT/CCACCCTTAACT...TTTAG|GTC | 0 | 1 | 39.353 |
| 115633161 | GT-AG | 0 | 1.000000099473604e-05 | 159 | rna-XM_034060693.1 21436516 | 15 | 24141532 | 24141690 | Melopsittacus undulatus 13146 | CAG|GTAAAAATAG...TTCCTCTTGTAT/AGTCTGCATATT...CTCAG|ACA | 1 | 1 | 41.522 |
| 115633162 | GT-AG | 0 | 1.000000099473604e-05 | 4482 | rna-XM_034060693.1 21436516 | 16 | 24136906 | 24141387 | Melopsittacus undulatus 13146 | AGG|GTGAGTGGAT...TTTGCTTTCTAT/TGAGTTTTCAGA...TGCAG|GTG | 1 | 1 | 43.192 |
| 115633163 | GT-AG | 0 | 1.000000099473604e-05 | 988 | rna-XM_034060693.1 21436516 | 17 | 24135634 | 24136621 | Melopsittacus undulatus 13146 | AAG|GTAAAGCAAC...TTTTCTTTTACA/TTTTCTTTTACA...TCCAG|GTT | 0 | 1 | 46.486 |
| 115633164 | GT-AG | 0 | 1.000000099473604e-05 | 1863 | rna-XM_034060693.1 21436516 | 18 | 24132153 | 24134015 | Melopsittacus undulatus 13146 | ATG|GTAAATTACT...GAGGTCTGATCG/CGTATACTAATG...TTTAG|GTA | 1 | 1 | 65.252 |
| 115633165 | GT-AG | 0 | 0.0004229096456918 | 781 | rna-XM_034060693.1 21436516 | 19 | 24131278 | 24132058 | Melopsittacus undulatus 13146 | AAG|GTATTTGGAG...CATGTCTTGACT/CATGTCTTGACT...TACAG|GTA | 2 | 1 | 66.342 |
| 115633166 | GT-AG | 0 | 1.000000099473604e-05 | 619 | rna-XM_034060693.1 21436516 | 20 | 24130474 | 24131092 | Melopsittacus undulatus 13146 | AAG|GTAAGAAATA...AGGCTTTTAACT/AGGCTTTTAACT...TACAG|ATA | 1 | 1 | 68.488 |
| 115633167 | GT-AG | 0 | 1.000000099473604e-05 | 1003 | rna-XM_034060693.1 21436516 | 21 | 24129238 | 24130240 | Melopsittacus undulatus 13146 | TGG|GTAAGTACTG...GTTTCCTTTCTG/ATGCAACTGATT...CACAG|CTG | 0 | 1 | 71.19 |
| 115633168 | GT-AG | 0 | 1.000000099473604e-05 | 1026 | rna-XM_034060693.1 21436516 | 22 | 24127934 | 24128959 | Melopsittacus undulatus 13146 | CAG|GTTGGTAAAG...CCTTTTGTGACT/CCTTTTGTGACT...AATAG|ATG | 2 | 1 | 74.414 |
| 115633169 | GT-AG | 0 | 1.000000099473604e-05 | 294 | rna-XM_034060693.1 21436516 | 23 | 24127425 | 24127718 | Melopsittacus undulatus 13146 | TAG|GTTAGCATCT...GAGTCTTTCCTT/TGGAGTGTGAAC...TGCAG|AGC | 1 | 1 | 76.908 |
| 115633170 | GC-AG | 0 | 1.000000099473604e-05 | 805 | rna-XM_034060693.1 21436516 | 24 | 24126492 | 24127296 | Melopsittacus undulatus 13146 | AAG|GCAAGTGTAA...CATCTCTGGATG/GTTGGTTTAAAT...TCTAG|AAA | 0 | 1 | 78.392 |
| 115633171 | GT-AG | 0 | 1.000000099473604e-05 | 170 | rna-XM_034060693.1 21436516 | 25 | 24126197 | 24126366 | Melopsittacus undulatus 13146 | AAG|GTAAAAAAAC...CTTTCCTTCCTT/AAGTTGCTGAAT...TTCAG|CAT | 2 | 1 | 79.842 |
| 115633172 | GT-AG | 0 | 1.9870504215780664e-05 | 82 | rna-XM_034060693.1 21436516 | 26 | 24125982 | 24126063 | Melopsittacus undulatus 13146 | AAG|GTAGGCCTAC...CTGCTGTTAACT/CTGCTGTTAACT...TCCAG|GTC | 0 | 1 | 81.385 |
| 115633173 | GT-AG | 0 | 1.000000099473604e-05 | 2502 | rna-XM_034060693.1 21436516 | 27 | 24123299 | 24125800 | Melopsittacus undulatus 13146 | AAG|GTACTGCAAG...AGATTTTTCACT/AGATTTTTCACT...CTTAG|TTC | 1 | 1 | 83.484 |
| 115633174 | GT-AG | 0 | 1.000000099473604e-05 | 283 | rna-XM_034060693.1 21436516 | 28 | 24122849 | 24123131 | Melopsittacus undulatus 13146 | GAG|GTGAGCAGTC...AGGTCTTTATCA/AAGGTCTTTATC...TCTAG|GTG | 0 | 1 | 85.421 |
| 115633175 | GT-AG | 0 | 1.000000099473604e-05 | 737 | rna-XM_034060693.1 21436516 | 29 | 24121847 | 24122583 | Melopsittacus undulatus 13146 | AAG|GTAAGTGATA...AAGTTGCTAACT/AAGTTGCTAACT...TGCAG|GTT | 1 | 1 | 88.495 |
| 115633176 | GT-AG | 0 | 0.0005095695689163 | 913 | rna-XM_034060693.1 21436516 | 30 | 24120753 | 24121665 | Melopsittacus undulatus 13146 | CCG|GTACATTAAG...TTTGTCTGGATG/TTTGTCTGGATG...TTCAG|GTG | 2 | 1 | 90.594 |
| 115633177 | GT-AG | 0 | 1.000000099473604e-05 | 913 | rna-XM_034060693.1 21436516 | 31 | 24119586 | 24120498 | Melopsittacus undulatus 13146 | TAG|GTAGGAAAGC...TGTACTTTGTCA/ATGTCATTTATT...TACAG|CAC | 1 | 1 | 93.54 |
| 115633178 | GT-AG | 0 | 0.0002276690585562 | 431 | rna-XM_034060693.1 21436516 | 32 | 24119000 | 24119430 | Melopsittacus undulatus 13146 | CAG|GTATGATGCA...TCTTTTTTATCT/TTCTTTTTTATC...TCCAG|AAT | 0 | 1 | 95.338 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);