introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
28 rows where transcript_id = 15550469
This data as json, CSV (advanced)
Suggested facets: score, length, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 84003804 | GT-AG | 0 | 3.0360758576766144e-05 | 104 | rna-XM_028386865.1 15550469 | 1 | 52050260 | 52050363 | Glycine soja 3848 | ACG|GTCCGCATTT...CGTTTTCTGACT/CGTTTTCTGACT...TGCAG|TTG | 0 | 1 | 0.807 |
| 84003805 | GT-AG | 0 | 0.0028805045467064 | 93 | rna-XM_028386865.1 15550469 | 2 | 52050569 | 52050661 | Glycine soja 3848 | AAG|GTAACCAACC...TGTTTCTTCTCC/AATTCGATCACC...TGCAG|CTC | 1 | 1 | 4.745 |
| 84003806 | GT-AG | 0 | 1.000000099473604e-05 | 102 | rna-XM_028386865.1 15550469 | 3 | 52050751 | 52050852 | Glycine soja 3848 | CAG|GTGAGGTGTT...TCGTTATTAATT/TCGTTATTAATT...TTTAG|GTT | 0 | 1 | 6.455 |
| 84003807 | GT-AG | 0 | 1.000000099473604e-05 | 146 | rna-XM_028386865.1 15550469 | 4 | 52050932 | 52051077 | Glycine soja 3848 | AAG|GTAAAGCTAA...ATGTTTTTACCG/AATGTTTTTACC...TCTAG|GTG | 1 | 1 | 7.973 |
| 84003808 | GT-AG | 0 | 0.0224128413726486 | 1106 | rna-XM_028386865.1 15550469 | 5 | 52051206 | 52052311 | Glycine soja 3848 | GAT|GTATGTATTT...GGAATTTTATTT/TGGAATTTTATT...TACAG|AGA | 0 | 1 | 10.432 |
| 84003809 | GT-AG | 0 | 1.000000099473604e-05 | 88 | rna-XM_028386865.1 15550469 | 6 | 52052411 | 52052498 | Glycine soja 3848 | AGG|GTAATTCTGT...TGCTGTTTGATA/TGCTGTTTGATA...TGTAG|GTT | 0 | 1 | 12.334 |
| 84003810 | GT-AG | 0 | 1.000000099473604e-05 | 298 | rna-XM_028386865.1 15550469 | 7 | 52052608 | 52052905 | Glycine soja 3848 | CAG|GTAAAAGCAT...TCATCATTTACT/CTCTTACTCATC...ATCAG|GCA | 1 | 1 | 14.428 |
| 84003811 | GT-AG | 0 | 7.719109442620272e-05 | 74 | rna-XM_028386865.1 15550469 | 8 | 52052986 | 52053059 | Glycine soja 3848 | CAG|GTATAAGTGT...CTTGTTTTATTA/ACTTGTTTTATT...TGCAG|ATT | 0 | 1 | 15.965 |
| 84003812 | GT-AG | 0 | 0.0004655271937445 | 86 | rna-XM_028386865.1 15550469 | 9 | 52053288 | 52053373 | Glycine soja 3848 | CAG|GTATTTAATG...TACTTCTAAATG/TTGGCATTTACT...TGCAG|TTA | 0 | 1 | 20.346 |
| 84003813 | GT-AG | 0 | 0.0028005251382842 | 584 | rna-XM_028386865.1 15550469 | 10 | 52053440 | 52054023 | Glycine soja 3848 | TTG|GTAACTTGAG...TGTTTCTTTTTT/CCTATAGTTATT...CTCAG|GTT | 0 | 1 | 21.614 |
| 84003814 | GT-AG | 0 | 0.0002917632974684 | 154 | rna-XM_028386865.1 15550469 | 11 | 52054141 | 52054294 | Glycine soja 3848 | CAG|GTATTTACGC...AAGTTCTTGAGT/CCTTTGTTCATT...TTCAG|AGT | 0 | 1 | 23.862 |
| 84003815 | GT-AG | 0 | 1.000000099473604e-05 | 196 | rna-XM_028386865.1 15550469 | 12 | 52054493 | 52054688 | Glycine soja 3848 | AAG|GTCCTGACAG...TATCCTTTATTT/CTATCCTTTATT...TGTAG|ACA | 0 | 1 | 27.666 |
| 84003816 | GT-AG | 0 | 0.0339683510032251 | 118 | rna-XM_028386865.1 15550469 | 13 | 52054806 | 52054923 | Glycine soja 3848 | CAG|GTATTTTGTT...TTTTTTTTAATA/TTTTTTTTAATA...TGAAG|GTT | 0 | 1 | 29.914 |
| 84003817 | GT-AG | 0 | 1.000000099473604e-05 | 308 | rna-XM_028386865.1 15550469 | 14 | 52054984 | 52055291 | Glycine soja 3848 | CAG|GTGAGAAGGT...TTTTTTTTGGTT/CATAATTTAATA...TGCAG|GGT | 0 | 1 | 31.066 |
| 84003818 | GT-AG | 0 | 0.0003855178910277 | 319 | rna-XM_028386865.1 15550469 | 15 | 52055379 | 52055697 | Glycine soja 3848 | CAG|GTTTATTTGC...TATTTCTTATGT/ATATTTCTTATG...TTCAG|AGG | 0 | 1 | 32.738 |
| 84003819 | GT-AG | 0 | 1.446634670385824e-05 | 103 | rna-XM_028386865.1 15550469 | 16 | 52055782 | 52055884 | Glycine soja 3848 | AAG|GTTCTTTTTA...TTTTTTTCAAAT/ATTTTTTTCAAA...TTTAG|GTT | 0 | 1 | 34.352 |
| 84003820 | GT-AG | 0 | 5.724915108715935e-05 | 82 | rna-XM_028386865.1 15550469 | 17 | 52056020 | 52056101 | Glycine soja 3848 | CAG|GTTCTTTGCT...TATTTCTTGAAC/TATTTCTTGAAC...TGCAG|CAT | 0 | 1 | 36.945 |
| 84003821 | GT-AG | 0 | 0.0001153636013427 | 102 | rna-XM_028386865.1 15550469 | 18 | 52056366 | 52056467 | Glycine soja 3848 | CAG|GTACTGTTTC...AGAATCTTACTA/ATCTTACTAATA...TACAG|ACT | 0 | 1 | 42.017 |
| 84003822 | GT-AG | 0 | 0.0040612050379451 | 113 | rna-XM_028386865.1 15550469 | 19 | 52056581 | 52056693 | Glycine soja 3848 | CAG|GTATTTCTTT...TTTTCTTTCAAT/ATTTTTGTAATT...TAAAG|TGA | 2 | 1 | 44.188 |
| 84003823 | GT-AG | 0 | 0.0068416077448169 | 120 | rna-XM_028386865.1 15550469 | 20 | 52056947 | 52057066 | Glycine soja 3848 | AAG|GTAACTTTGT...TAAACTTTGAAC/AAAGTTTTAAAA...TTTAG|GTC | 0 | 1 | 49.049 |
| 84003824 | GT-AG | 0 | 1.000000099473604e-05 | 70 | rna-XM_028386865.1 15550469 | 21 | 52057204 | 52057273 | Glycine soja 3848 | GAG|GTGCGTCTCT...TCTCATTTAATT/TCATTTCTCATT...CATAG|GAA | 2 | 1 | 51.681 |
| 84003825 | GT-AG | 0 | 1.000000099473604e-05 | 98 | rna-XM_028386865.1 15550469 | 22 | 52057553 | 52057650 | Glycine soja 3848 | AAG|GTGTGTACTG...TATTTTCTAATG/TATTTTCTAATG...GTCAG|ATA | 2 | 1 | 57.041 |
| 84003826 | GT-AG | 0 | 1.000000099473604e-05 | 237 | rna-XM_028386865.1 15550469 | 23 | 52058654 | 52058890 | Glycine soja 3848 | CAG|GTGACACCAT...AACCTTTTGACC/CTTTGTCTAACC...TGAAG|GTG | 0 | 1 | 76.311 |
| 84003827 | GT-AG | 0 | 3.823393079582247e-05 | 124 | rna-XM_028386865.1 15550469 | 24 | 52059377 | 52059500 | Glycine soja 3848 | AAG|GTCTGCAATT...TATTCTGTAATA/GGATTTTTTATT...TTCAG|GCT | 0 | 1 | 85.648 |
| 84003828 | GT-AG | 0 | 1.000000099473604e-05 | 118 | rna-XM_028386865.1 15550469 | 25 | 52059672 | 52059789 | Glycine soja 3848 | AAG|GTGAATCTCG...TTATTATTGATT/TTATTATTGATT...AGCAG|ATT | 0 | 1 | 88.934 |
| 84003829 | GT-AG | 0 | 8.604876280678742e-05 | 192 | rna-XM_028386865.1 15550469 | 26 | 52059976 | 52060167 | Glycine soja 3848 | CAG|GTGTTTATTT...TTTCTTTTAATT/TTTCTTTTAATT...TGCAG|GGA | 0 | 1 | 92.507 |
| 84003830 | GT-AG | 0 | 0.0005840948385732 | 87 | rna-XM_028386865.1 15550469 | 27 | 52060291 | 52060377 | Glycine soja 3848 | CAG|GTATGTATTT...AAAATTTTGAAA/TTTTTGCTAAAA...TGAAG|TTT | 0 | 1 | 94.87 |
| 84003831 | GT-AG | 0 | 2.5628389468323163 | 80 | rna-XM_028386865.1 15550469 | 28 | 52060489 | 52060568 | Glycine soja 3848 | CAG|GTACCCTTTT...TATTGTTTGATT/TATTGTTTGATT...TACAG|AAT | 0 | 1 | 97.003 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);