introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 32672005
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 182503839 | GT-AG | 0 | 1.000000099473604e-05 | 889 | rna-XM_030231139.1 32672005 | 1 | 4715318 | 4716206 | Serinus canaria 9135 | CAG|GTAACGCAAT...CTTGCTTTAAGC/AACATCCTGATT...CCTAG|GAC | 0 | 1 | 3.069 |
| 182503840 | GT-AG | 0 | 1.1683135169796168e-05 | 551 | rna-XM_030231139.1 32672005 | 2 | 4716348 | 4716898 | Serinus canaria 9135 | AAG|GTACAAGTCC...TTCTCTTTATTC/TCTTTATTCACC...GGCAG|CTT | 0 | 1 | 5.79 |
| 182503841 | GT-AG | 0 | 1.000000099473604e-05 | 536 | rna-XM_030231139.1 32672005 | 3 | 4717056 | 4717591 | Serinus canaria 9135 | CAG|GTGAGCATTC...ACTTTTTTGTTT/CTATTACTAACT...TGCAG|TGG | 1 | 1 | 8.821 |
| 182503842 | GT-AG | 0 | 0.03765417944279 | 907 | rna-XM_030231139.1 32672005 | 4 | 4717703 | 4718609 | Serinus canaria 9135 | AAG|GTAACTTTTG...TTTTCTTTGAAT/ATATTTGTCATT...TGCAG|GAG | 1 | 1 | 10.963 |
| 182503843 | GT-AG | 0 | 0.0013095144599406 | 256 | rna-XM_030231139.1 32672005 | 5 | 4718762 | 4719017 | Serinus canaria 9135 | AAG|GTACATTTTA...TCATCCTCAAAA/CTGTGTTTCATG...CAAAG|AAA | 0 | 1 | 13.897 |
| 182503844 | GT-AG | 0 | 0.0300739057091715 | 244 | rna-XM_030231139.1 32672005 | 6 | 4719218 | 4719461 | Serinus canaria 9135 | AGG|GTATGCAGTG...ATTTTTTTAATC/ATTTTTTTAATC...TTCAG|TGA | 2 | 1 | 17.757 |
| 182503845 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_030231139.1 32672005 | 7 | 4719544 | 4719935 | Serinus canaria 9135 | AAG|GTGAGTAGGT...TTTTTCTTTTTT/ATGATAATTACA...TGAAG|GAG | 0 | 1 | 19.34 |
| 182503846 | GT-AG | 0 | 1.000000099473604e-05 | 792 | rna-XM_030231139.1 32672005 | 8 | 4720024 | 4720815 | Serinus canaria 9135 | CAG|GTAAGGGTTC...TCTGTTTTAATT/TTTGTTTTTATT...CCAAG|GAA | 1 | 1 | 21.038 |
| 182503847 | GT-AG | 0 | 1.000000099473604e-05 | 752 | rna-XM_030231139.1 32672005 | 9 | 4720990 | 4721741 | Serinus canaria 9135 | CAT|GTCAGTAGTG...GCAGCCTTTGCT/TGGAGTCTGATT...TGCAG|TGC | 1 | 1 | 24.397 |
| 182503848 | GT-AG | 0 | 1.000000099473604e-05 | 902 | rna-XM_030231139.1 32672005 | 10 | 4721879 | 4722780 | Serinus canaria 9135 | GGG|GTGAGTGATT...ATCCATTTAATT/TTTGTTTTCATA...CTTAG|GAG | 0 | 1 | 27.041 |
| 182503849 | GT-AG | 0 | 1.000000099473604e-05 | 263 | rna-XM_030231139.1 32672005 | 11 | 4722874 | 4723136 | Serinus canaria 9135 | CAG|GTTAGTGCCA...TTTCTCTTCTCT/AAAACAGTTACA...TTCAG|GTA | 0 | 1 | 28.836 |
| 182503850 | GT-AG | 0 | 5.210608443036576e-05 | 876 | rna-XM_030231139.1 32672005 | 12 | 4723228 | 4724103 | Serinus canaria 9135 | CAG|GTTTGTTTCT...ACTATGTTAATA/TAATACCTAATT...TCCAG|GGC | 1 | 1 | 30.593 |
| 182503851 | GT-AG | 0 | 6.797510973947667e-05 | 1699 | rna-XM_030231139.1 32672005 | 13 | 4724267 | 4725965 | Serinus canaria 9135 | CAC|GTAGGTAGCT...GATTTTTTAATC/TGTTTATTGACT...TGCAG|CAA | 2 | 1 | 33.739 |
| 182503852 | GT-AG | 0 | 0.0002500210572979 | 1316 | rna-XM_030231139.1 32672005 | 14 | 4726159 | 4727474 | Serinus canaria 9135 | GAG|GTATGGCTTG...TTCTCCTTCATC/TTCTCCTTCATC...TGCAG|CTG | 0 | 1 | 37.464 |
| 182503853 | GT-AG | 0 | 4.052436093560241e-05 | 495 | rna-XM_030231139.1 32672005 | 15 | 4727534 | 4728028 | Serinus canaria 9135 | TTG|GTAAGCTGAG...TCTACCATATTT/TATTTAATGATT...CAAAG|TTT | 2 | 1 | 38.603 |
| 182503854 | GT-AG | 0 | 2.002825794440838e-05 | 1271 | rna-XM_030231139.1 32672005 | 16 | 4728204 | 4729474 | Serinus canaria 9135 | AAG|GTTTGTGTGC...TTGATTTTAATT/AATTGTCTCATT...TTCAG|CTG | 0 | 1 | 41.98 |
| 182503855 | GT-AG | 0 | 1.1991820464705337e-05 | 223 | rna-XM_030231139.1 32672005 | 17 | 4729569 | 4729791 | Serinus canaria 9135 | TAG|GTAAAGTTTC...GATTTCTTACAA/TGATTTCTTACA...TCTAG|AAG | 1 | 1 | 43.795 |
| 182503856 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_030231139.1 32672005 | 18 | 4729923 | 4730230 | Serinus canaria 9135 | AAG|GTAAGGAGCA...GTTTCCTGCTCT/TCCATTCTGATG...CCTAG|GGT | 0 | 1 | 46.323 |
| 182503857 | GT-AG | 0 | 1.000000099473604e-05 | 773 | rna-XM_030231139.1 32672005 | 19 | 4730390 | 4731162 | Serinus canaria 9135 | GAG|GTAAGTCAGT...TTACCCTTGCTT/CAACATTTTACC...CCCAG|GTA | 0 | 1 | 49.392 |
| 182503858 | GT-AG | 0 | 1.000000099473604e-05 | 243 | rna-XM_030231139.1 32672005 | 20 | 4731303 | 4731545 | Serinus canaria 9135 | AAG|GTAAGGAGGA...GTTTCATTAACA/TGTGTGTTTATT...TCCAG|TCA | 2 | 1 | 52.094 |
| 182503859 | GT-AG | 0 | 1.000000099473604e-05 | 1089 | rna-XM_030231139.1 32672005 | 21 | 4731700 | 4732788 | Serinus canaria 9135 | AAG|GTAAATGCTC...ATGTCCTTTCTC/CATGAACTGAAT...GGCAG|CTC | 0 | 1 | 55.067 |
| 182503860 | GT-AG | 0 | 1.000000099473604e-05 | 1037 | rna-XM_030231139.1 32672005 | 22 | 4732900 | 4733936 | Serinus canaria 9135 | GTG|GTAAGGTTAC...TGATTCTTTGCT/ACAAGCTTCATT...TCCAG|GGT | 0 | 1 | 57.209 |
| 182503861 | GT-AG | 0 | 1.000000099473604e-05 | 350 | rna-XM_030231139.1 32672005 | 23 | 4734068 | 4734417 | Serinus canaria 9135 | CAG|GTAATATCTG...CATGTCTAAATA/CCATGTCTAAAT...GGCAG|TGA | 2 | 1 | 59.738 |
| 182503862 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_030231139.1 32672005 | 24 | 4734604 | 4734691 | Serinus canaria 9135 | CCC|GTGAGTCTTG...TTTTTTTTCATG/TTTTTTTTCATG...CCAAG|CTA | 2 | 1 | 63.328 |
| 182503863 | GT-AG | 0 | 1.000000099473604e-05 | 1154 | rna-XM_030231139.1 32672005 | 25 | 4734751 | 4735904 | Serinus canaria 9135 | CAG|GTATGAAAAT...ATTTCCTCAAAA/CATTTCCTCAAA...TTCAG|AGT | 1 | 1 | 64.466 |
| 182503864 | GT-AG | 0 | 1.000000099473604e-05 | 495 | rna-XM_030231139.1 32672005 | 26 | 4736057 | 4736551 | Serinus canaria 9135 | AAG|GTAGGTGGGG...GTTATCTTGATT/TTGATTCTAACA...AATAG|GTT | 0 | 1 | 67.4 |
| 182503865 | GT-AG | 0 | 0.0001296792620622 | 698 | rna-XM_030231139.1 32672005 | 27 | 4736656 | 4737353 | Serinus canaria 9135 | CAG|GTATAGTTCA...CATGCACTGACA/CATGCACTGACA...TCCAG|GTT | 2 | 1 | 69.407 |
| 182503866 | GT-AG | 0 | 1.000000099473604e-05 | 597 | rna-XM_030231139.1 32672005 | 28 | 4737425 | 4738021 | Serinus canaria 9135 | TTG|GTAAGGACGG...TTTTCCTTTCCC/CCCTCACTGACT...TTCAG|ACA | 1 | 1 | 70.778 |
| 182503867 | GT-AG | 0 | 1.000000099473604e-05 | 1007 | rna-XM_030231139.1 32672005 | 29 | 4738240 | 4739246 | Serinus canaria 9135 | GAG|GTCAGCTCTT...ACAGTTTTAACC/ACAGTTTTAACC...TGCAG|CCT | 0 | 1 | 74.986 |
| 182503868 | GT-AG | 0 | 0.0007068554202599 | 1225 | rna-XM_030231139.1 32672005 | 30 | 4739382 | 4740606 | Serinus canaria 9135 | ATA|GTAAGCTTTG...GTGCACTGAACG/CGTGCACTGAAC...CGCAG|GTG | 0 | 1 | 77.591 |
| 182503869 | GT-AG | 0 | 1.209881195061617e-05 | 975 | rna-XM_030231139.1 32672005 | 31 | 4740730 | 4741704 | Serinus canaria 9135 | AAG|GTAAACTGAC...TCAGGTTTAGTG/TCTGTTTCCATC...GGCAG|CTG | 0 | 1 | 79.965 |
| 182503870 | GT-AG | 0 | 1.0880171337495852e-05 | 87 | rna-XM_030231139.1 32672005 | 32 | 4741800 | 4741886 | Serinus canaria 9135 | CCT|GTAAGTGGCA...TGCTTCTTTACA/TGCTTCTTTACA...CTCAG|GTT | 2 | 1 | 81.799 |
| 182503871 | GT-AG | 0 | 1.000000099473604e-05 | 526 | rna-XM_030231139.1 32672005 | 33 | 4741972 | 4742497 | Serinus canaria 9135 | GGA|GTGAGTATTT...CAGTTCTGAATC/TTAAAACTGACC...CCCAG|GTT | 0 | 1 | 83.439 |
| 182503872 | GT-AG | 0 | 1.8789491651397592e-05 | 894 | rna-XM_030231139.1 32672005 | 34 | 4742708 | 4743601 | Serinus canaria 9135 | CTG|GTAGGCTCAA...TTGTCATTTATG/ACAGTTGTCATT...CACAG|GTG | 0 | 1 | 87.493 |
| 182503873 | GT-AG | 0 | 1.000000099473604e-05 | 948 | rna-XM_030231139.1 32672005 | 35 | 4743764 | 4744711 | Serinus canaria 9135 | CAG|GTAAGGTCAG...TGGGTTTTATTC/GTGGGTTTTATT...TGCAG|CTG | 0 | 1 | 90.62 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);