introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
32 rows where transcript_id = 19905852
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 106496802 | GT-AG | 0 | 1.000000099473604e-05 | 368 | rna-XM_021538420.2 19905852 | 1 | 30118457 | 30118824 | Lonchura striata 40157 | CGG|GTGAGTGCCG...CTTCTCTTCTCC/CCTTTCCTCCCC...CCTAG|GAC | 0 | 1 | 1.373 |
| 106496803 | GT-AG | 0 | 1.000000099473604e-05 | 76765 | rna-XM_021538420.2 19905852 | 2 | 30041626 | 30118390 | Lonchura striata 40157 | AAG|GTTTGTGTCA...TGTTCTCTCTCT/CTGGTGCTGACT...CTCAG|GGT | 0 | 1 | 2.962 |
| 106496804 | GT-AG | 0 | 1.000000099473604e-05 | 27337 | rna-XM_021538420.2 19905852 | 3 | 30014232 | 30041568 | Lonchura striata 40157 | GAG|GTGAGTACAA...TTTGTTTTATTT/GTTTGTTTTATT...CAAAG|GAT | 0 | 1 | 4.335 |
| 106496805 | GT-AG | 0 | 5.139996607956314e-05 | 670 | rna-XM_021538420.2 19905852 | 4 | 30013436 | 30014105 | Lonchura striata 40157 | TGG|GTAAGCAGAT...CTATCTTTAATG/CTATCTTTAATG...TCCAG|CTT | 0 | 1 | 7.37 |
| 106496806 | GT-AG | 0 | 1.000000099473604e-05 | 2731 | rna-XM_021538420.2 19905852 | 5 | 30010594 | 30013324 | Lonchura striata 40157 | CGG|GTAAGGAAAA...TGACTCTGAACA/AAAAGCCTGACT...TGCAG|GGG | 0 | 1 | 10.043 |
| 106496807 | GT-AG | 0 | 1.000000099473604e-05 | 1546 | rna-XM_021538420.2 19905852 | 6 | 30008957 | 30010502 | Lonchura striata 40157 | TTG|GTAAGTAGGT...CCCCTCTTATTT/TCTTATTTCATT...CACAG|TGG | 1 | 1 | 12.235 |
| 106496808 | GT-AG | 0 | 1.000000099473604e-05 | 2268 | rna-XM_021538420.2 19905852 | 7 | 30006558 | 30008825 | Lonchura striata 40157 | AGA|GTAAGAGTCC...TCTCTCTTTTTC/CACCTACTAACA...TGCAG|AGT | 0 | 1 | 15.39 |
| 106496809 | GT-AG | 0 | 1.1371100777254871e-05 | 2341 | rna-XM_021538420.2 19905852 | 8 | 30004162 | 30006502 | Lonchura striata 40157 | CCC|GTAAGTGATC...GTGTCTCTATCT/GAGCAACTAAAC...TTCAG|CAC | 1 | 1 | 16.715 |
| 106496810 | GT-AG | 0 | 0.0021074967499372 | 2164 | rna-XM_021538420.2 19905852 | 9 | 30001919 | 30004082 | Lonchura striata 40157 | ATG|GTATGTATTA...TTTTCCTTCTTG/CTGAGTTTCACC...CTCAG|GTC | 2 | 1 | 18.618 |
| 106496811 | GT-AG | 0 | 0.0013543697268574 | 664 | rna-XM_021538420.2 19905852 | 10 | 30001079 | 30001742 | Lonchura striata 40157 | AAG|GTATGCAGCT...TGTTTCTTTCCT/CTAAAATTAAAG...GTTAG|ATG | 1 | 1 | 22.856 |
| 106496812 | GT-AG | 0 | 1.000000099473604e-05 | 454 | rna-XM_021538420.2 19905852 | 11 | 30000552 | 30001005 | Lonchura striata 40157 | AAG|GTAATAATTG...AAGGCTTTAACT/AAGGCTTTAACT...TGCAG|TTC | 2 | 1 | 24.615 |
| 106496813 | GT-AG | 0 | 1.000000099473604e-05 | 7280 | rna-XM_021538420.2 19905852 | 12 | 29993061 | 30000340 | Lonchura striata 40157 | GAG|GTAGAAACAG...CTTTCTTTGATA/CTTTCTTTGATA...TGTAG|CAA | 0 | 1 | 29.697 |
| 106496814 | GT-AG | 0 | 1.000000099473604e-05 | 2100 | rna-XM_021538420.2 19905852 | 13 | 29990799 | 29992898 | Lonchura striata 40157 | CAG|GTGAGTCCAT...TTCCTTTTGAAT/AATAAATTTATT...GACAG|GAG | 0 | 1 | 33.598 |
| 106496815 | GT-AG | 0 | 1.000000099473604e-05 | 656 | rna-XM_021538420.2 19905852 | 14 | 29990056 | 29990711 | Lonchura striata 40157 | CTG|GTAAAGCACT...CATGTCTTGAAT/GAATTCCTCACC...CAAAG|GAA | 0 | 1 | 35.694 |
| 106496816 | GT-AG | 0 | 1.000000099473604e-05 | 3277 | rna-XM_021538420.2 19905852 | 15 | 29986560 | 29989836 | Lonchura striata 40157 | GAG|GTAAGTCCTG...ATTGCTTTACTG/GCTTTACTGACT...CTTAG|GTG | 0 | 1 | 40.968 |
| 106496817 | GT-AG | 0 | 1.000000099473604e-05 | 2021 | rna-XM_021538420.2 19905852 | 16 | 29984377 | 29986397 | Lonchura striata 40157 | CAG|GTAAGTGCCC...TATGTTTTGTTT/GTTATACTGAGC...GTAAG|GTA | 0 | 1 | 44.87 |
| 106496818 | GT-AG | 0 | 1.000000099473604e-05 | 922 | rna-XM_021538420.2 19905852 | 17 | 29983224 | 29984145 | Lonchura striata 40157 | AGG|GTAAGGAGTA...ATGGGTTTAACA/ATGGGTTTAACA...CCAAG|GTA | 0 | 1 | 50.434 |
| 106496819 | GT-AG | 0 | 1.000000099473604e-05 | 930 | rna-XM_021538420.2 19905852 | 18 | 29982187 | 29983116 | Lonchura striata 40157 | GAG|GTTTGTAAGC...TCTTTCTTCCCT/GTGTGTGTGACA...TATAG|CAA | 2 | 1 | 53.011 |
| 106496820 | GT-AG | 0 | 2.711661411312046e-05 | 701 | rna-XM_021538420.2 19905852 | 19 | 29981334 | 29982034 | Lonchura striata 40157 | GAT|GTAAGTCCTT...CCCTGCTTAATC/CCCTGCTTAATC...TGCAG|CAT | 1 | 1 | 56.671 |
| 106496821 | GT-AG | 0 | 1.000000099473604e-05 | 338 | rna-XM_021538420.2 19905852 | 20 | 29980898 | 29981235 | Lonchura striata 40157 | GCT|GTGAGTTGCG...TTGTCCTGAATG/TTTGTCCTGAAT...TGCAG|GGA | 0 | 1 | 59.032 |
| 106496822 | GT-AG | 0 | 1.000000099473604e-05 | 550 | rna-XM_021538420.2 19905852 | 21 | 29980339 | 29980888 | Lonchura striata 40157 | GTG|GTAAGACCTT...TCAGTTTTAACA/TTCTCTCTTACT...AACAG|GAT | 0 | 1 | 59.249 |
| 106496823 | GT-AG | 0 | 1.000000099473604e-05 | 391 | rna-XM_021538420.2 19905852 | 22 | 29979772 | 29980162 | Lonchura striata 40157 | AGT|GTTAGTCCCT...TATTTCTTGTCT/TGTTATCTCACT...AACAG|GTC | 2 | 1 | 63.487 |
| 106496824 | GT-AG | 0 | 7.555789891736908e-05 | 1158 | rna-XM_021538420.2 19905852 | 23 | 29978493 | 29979650 | Lonchura striata 40157 | CAG|GTATTGCTTT...AGACTTTTAAGC/AGACTTTTAAGC...TACAG|AGT | 0 | 1 | 66.402 |
| 106496825 | GT-AG | 0 | 0.0028107547431955 | 2364 | rna-XM_021538420.2 19905852 | 24 | 29975937 | 29978300 | Lonchura striata 40157 | GAG|GTATGTCTTA...TTGTCATTAATA/TTTATTGTCATT...TGCAG|ACT | 0 | 1 | 71.026 |
| 106496826 | GT-AG | 0 | 1.000000099473604e-05 | 275 | rna-XM_021538420.2 19905852 | 25 | 29975544 | 29975818 | Lonchura striata 40157 | TGG|GTAAGTAAAG...TGCACTTTACTT/ATGCACTTTACT...TTCAG|TAG | 1 | 1 | 73.868 |
| 106496827 | GT-AG | 0 | 1.000000099473604e-05 | 3758 | rna-XM_021538420.2 19905852 | 26 | 29971615 | 29975372 | Lonchura striata 40157 | GGG|GTAATGCAAA...GGATCTTTGGTT/AAAGTACTAACT...TCTAG|GAG | 1 | 1 | 77.987 |
| 106496828 | GT-AG | 0 | 1.000000099473604e-05 | 5013 | rna-XM_021538420.2 19905852 | 27 | 29966458 | 29971470 | Lonchura striata 40157 | CTG|GTAAAGTTAA...TTTTTTTTCATT/TTTTTTTTCATT...GTTAG|GTA | 1 | 1 | 81.455 |
| 106496829 | GT-AG | 0 | 1.000000099473604e-05 | 2328 | rna-XM_021538420.2 19905852 | 28 | 29963995 | 29966322 | Lonchura striata 40157 | TTG|GTGAGTAGCA...TTTGGCTTAATT/TTTGGCTTAATT...TTCAG|TAA | 1 | 1 | 84.706 |
| 106496830 | GT-AG | 0 | 2.4169344394323334e-05 | 711 | rna-XM_021538420.2 19905852 | 29 | 29963183 | 29963893 | Lonchura striata 40157 | AAG|GTAACAGTAT...TTGTTCTAAGCT/GTGAATCTCACT...CACAG|TCA | 0 | 1 | 87.139 |
| 106496831 | GT-AG | 0 | 1.000000099473604e-05 | 836 | rna-XM_021538420.2 19905852 | 30 | 29962197 | 29963032 | Lonchura striata 40157 | CAT|GTAAGAGCTG...CCTCTTTTGTCT/GCAGCATTAAAA...TGTAG|ATC | 0 | 1 | 90.751 |
| 106496832 | GT-AG | 0 | 1.000000099473604e-05 | 1921 | rna-XM_021538420.2 19905852 | 31 | 29960116 | 29962036 | Lonchura striata 40157 | TTG|GTAAGTGCCT...CCTCTCCTAATG/CCTCTCCTAATG...CAAAG|CAT | 1 | 1 | 94.605 |
| 106496833 | GT-AG | 0 | 0.0001475440740009 | 723 | rna-XM_021538420.2 19905852 | 32 | 29959253 | 29959975 | Lonchura striata 40157 | AAG|GTACATTCCC...AATCTCTTTTCT/ATTAGAATCACT...TGCAG|GTT | 0 | 1 | 97.977 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);