introns
Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)
- id
- INTEGER (primary key), globally unique identifier for each intron
- dinucleotide_pair
- TEXT, terminal dinucleotide sequences of the intron
- is_minor
- INTEGER, indicates if the intron is a minor intron (1) or not (0)
- score
- REAL, score representing the probability (0-100%) of the intron being minor
- length
- INTEGER, length of the intron in base pairs
- transcript_id
- INTEGER (foreign key referencing transcripts(id)), parent transcript
- ordinal_index
- INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
- start
- INTEGER, start position of the intron in the genome
- end
- INTEGER, end position of the intron in the genome
- taxonomy_id
- INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
- scored_motifs
- TEXT, motifs scored for the intron
- phase
- INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
- in_cds
- INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
- relative_position
- REAL, relative position of the intron within the transcript (as a percentage of coding length)
35 rows where transcript_id = 14614476
This data as json, CSV (advanced)
Suggested facets: score, phase
| id ▼ | dinucleotide_pair | is_minor | score | length | transcript_id | ordinal_index | start | end | taxonomy_id | scored_motifs | phase | in_cds | relative_position |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 78405774 | GT-AG | 0 | 0.0026820257492343 | 1651 | rna-XM_005038275.2 14614476 | 1 | 91000227 | 91001877 | Ficedula albicollis 59894 | GCC|GTAAGCTATG...TCCTCCTTCTCT/CCTTCTCTGACC...CCTAG|GCA | 2 | 1 | 1.924 |
| 78405775 | GT-AG | 0 | 1.000000099473604e-05 | 1042 | rna-XM_005038275.2 14614476 | 2 | 90999001 | 91000042 | Ficedula albicollis 59894 | TCT|GTGAGTATCT...TCCTTCTTACTT/TTCCTTCTTACT...TCCAG|CTT | 0 | 1 | 6.04 |
| 78405776 | GT-AG | 0 | 3.543257338447481e-05 | 883 | rna-XM_005038275.2 14614476 | 3 | 90997958 | 90998840 | Ficedula albicollis 59894 | CAG|GTACCAGGAA...CTGTCTTTCTCC/TGGTTACTGATT...ATCAG|TCC | 1 | 1 | 9.62 |
| 78405777 | GT-AG | 0 | 2.892893917523625e-05 | 587 | rna-XM_005038275.2 14614476 | 4 | 90997318 | 90997904 | Ficedula albicollis 59894 | ATG|GTAAGCATGA...TCTGTCTAAACA/CTCTGTCTAAAC...TCTAG|ATT | 0 | 1 | 10.805 |
| 78405778 | GT-AG | 0 | 1.000000099473604e-05 | 343 | rna-XM_005038275.2 14614476 | 5 | 90996954 | 90997296 | Ficedula albicollis 59894 | GAG|GTATGTACAA...TTGTTCTCACAA/ATTGTTCTCACA...TGTAG|GAT | 0 | 1 | 11.275 |
| 78405779 | GT-AG | 0 | 0.0057748021505573 | 1162 | rna-XM_005038275.2 14614476 | 6 | 90995623 | 90996784 | Ficedula albicollis 59894 | ATG|GTAACTATCA...TATTCCCTATCA/TCTGATGTTATT...TGCAG|TGC | 1 | 1 | 15.056 |
| 78405780 | GT-AG | 0 | 1.000000099473604e-05 | 727 | rna-XM_005038275.2 14614476 | 7 | 90994811 | 90995537 | Ficedula albicollis 59894 | ACT|GTGAGTATCA...TCCCTCTTACAT/TACATTCTCAAT...TGTAG|ATA | 2 | 1 | 16.957 |
| 78405781 | GT-AG | 0 | 1.000000099473604e-05 | 1460 | rna-XM_005038275.2 14614476 | 8 | 90993227 | 90994686 | Ficedula albicollis 59894 | CAG|GTGAGTCAGG...ATGGTCTTGTTT/CATTGTATAAAT...AACAG|ACA | 0 | 1 | 19.732 |
| 78405782 | GT-AG | 0 | 1.922516121757142e-05 | 245 | rna-XM_005038275.2 14614476 | 9 | 90992867 | 90993111 | Ficedula albicollis 59894 | CAG|GTATGGATCA...TAGTTCTTCATG/TAGTTCTTCATG...TGCAG|GAG | 1 | 1 | 22.304 |
| 78405783 | GT-AG | 0 | 1.000000099473604e-05 | 335 | rna-XM_005038275.2 14614476 | 10 | 90992422 | 90992756 | Ficedula albicollis 59894 | CAG|GTAGGGAAAC...TCTTTCTAAACT/CTCTTTCTAAAC...TCCAG|GTG | 0 | 1 | 24.765 |
| 78405784 | GT-AG | 0 | 1.000000099473604e-05 | 1619 | rna-XM_005038275.2 14614476 | 11 | 90990644 | 90992262 | Ficedula albicollis 59894 | AGG|GTAAGTCCTT...ATTTTCTCATTT/CATTTTCTCATT...TCCAG|GCA | 0 | 1 | 28.322 |
| 78405785 | GT-AG | 0 | 1.000000099473604e-05 | 333 | rna-XM_005038275.2 14614476 | 12 | 90990083 | 90990415 | Ficedula albicollis 59894 | CTG|GTAAGTTTGC...TCAGGCTTGGCT/GCTTGGCTAATG...TTCAG|GTG | 0 | 1 | 33.423 |
| 78405786 | GT-AG | 0 | 1.000000099473604e-05 | 485 | rna-XM_005038275.2 14614476 | 13 | 90989534 | 90990018 | Ficedula albicollis 59894 | GTG|GTGAGTTCCT...TGCACCTTAACT/TGCACCTTAACT...GCTAG|ACA | 1 | 1 | 34.855 |
| 78405787 | GT-AG | 0 | 1.000000099473604e-05 | 656 | rna-XM_005038275.2 14614476 | 14 | 90988735 | 90989390 | Ficedula albicollis 59894 | AAG|GTAAAGTGTC...GCATCTGTGACA/GCATCTGTGACA...TACAG|GTG | 0 | 1 | 38.054 |
| 78405788 | GT-AG | 0 | 0.0008006532244841 | 691 | rna-XM_005038275.2 14614476 | 15 | 90987894 | 90988584 | Ficedula albicollis 59894 | TCT|GTAAGTTCTT...TGTTTTGTAACA/TGTTTTGTAACA...CCAAG|GTG | 0 | 1 | 41.409 |
| 78405789 | GT-AG | 0 | 2.6708436479328452e-05 | 617 | rna-XM_005038275.2 14614476 | 16 | 90987094 | 90987710 | Ficedula albicollis 59894 | AAG|GTATAGACCC...TAAACCTCAACA/ACAGAATTCAGT...CATAG|GAA | 0 | 1 | 45.503 |
| 78405790 | GT-AG | 0 | 1.000000099473604e-05 | 1883 | rna-XM_005038275.2 14614476 | 17 | 90985060 | 90986942 | Ficedula albicollis 59894 | GCG|GTAAAGGAAC...ATTTCTTTTGCA/TTGCATCTCACC...TTCAG|GCA | 1 | 1 | 48.881 |
| 78405791 | GT-AG | 0 | 1.000000099473604e-05 | 310 | rna-XM_005038275.2 14614476 | 18 | 90984641 | 90984950 | Ficedula albicollis 59894 | AAG|GTGAGTACCT...CTGTCCTTGCTG/GTAAAATTGATC...ACCAG|CTC | 2 | 1 | 51.32 |
| 78405792 | GT-AG | 0 | 1.000000099473604e-05 | 268 | rna-XM_005038275.2 14614476 | 19 | 90984144 | 90984411 | Ficedula albicollis 59894 | AGG|GTAAGGGAGG...CTTTTCTTAATT/CTTTTCTTAATT...GGCAG|GTC | 0 | 1 | 56.443 |
| 78405793 | GT-AG | 0 | 1.000000099473604e-05 | 628 | rna-XM_005038275.2 14614476 | 20 | 90983389 | 90984016 | Ficedula albicollis 59894 | TAG|GTAAAGGTCT...TGACCCCTGTTT/GGCTGACTGACC...TCCAG|GGC | 1 | 1 | 59.284 |
| 78405794 | GT-AG | 0 | 1.1584630764447777e-05 | 1377 | rna-XM_005038275.2 14614476 | 21 | 90981890 | 90983266 | Ficedula albicollis 59894 | GAG|GTACGTGGGC...TTTTCCTTCATC/TCCTGTCTCACT...CCAAG|CCT | 0 | 1 | 62.013 |
| 78405795 | GT-AG | 0 | 1.000000099473604e-05 | 505 | rna-XM_005038275.2 14614476 | 22 | 90981333 | 90981837 | Ficedula albicollis 59894 | AAG|GTAAAGCCCA...ATTTCTTTGAAC/ATTTCTTTGAAC...CTCAG|GAG | 1 | 1 | 63.177 |
| 78405796 | GT-AG | 0 | 1.000000099473604e-05 | 392 | rna-XM_005038275.2 14614476 | 23 | 90980857 | 90981248 | Ficedula albicollis 59894 | TGG|GTGAGTTAAC...CAGCCCTAAATA/ATATTGCTAACA...CTCAG|GTG | 1 | 1 | 65.056 |
| 78405797 | GT-AG | 0 | 1.000000099473604e-05 | 513 | rna-XM_005038275.2 14614476 | 24 | 90980167 | 90980679 | Ficedula albicollis 59894 | GCG|GTGAGCTGGG...CTTGCCTTATCT/TCTTGCCTTATC...TGCAG|GTT | 1 | 1 | 69.016 |
| 78405798 | GT-AG | 0 | 1.000000099473604e-05 | 1171 | rna-XM_005038275.2 14614476 | 25 | 90978911 | 90980081 | Ficedula albicollis 59894 | CTG|GTGAGATTTC...CTTTCTTTTTCT/GGCCTGCTCAAG...TTCAG|GCT | 2 | 1 | 70.917 |
| 78405799 | GT-AG | 0 | 1.000000099473604e-05 | 272 | rna-XM_005038275.2 14614476 | 26 | 90978482 | 90978753 | Ficedula albicollis 59894 | AAG|GTAACAAGCC...TCTACCTTTGTG/AGTGGGGTCAGC...TGCAG|GGT | 0 | 1 | 74.43 |
| 78405800 | GT-AG | 0 | 0.2874254193434918 | 270 | rna-XM_005038275.2 14614476 | 27 | 90978137 | 90978406 | Ficedula albicollis 59894 | ACT|GTATGTTTCT...CATTTCTAATCA/ACATTTCTAATC...CCCAG|TAC | 0 | 1 | 76.107 |
| 78405801 | GT-AG | 0 | 1.000000099473604e-05 | 941 | rna-XM_005038275.2 14614476 | 28 | 90977030 | 90977970 | Ficedula albicollis 59894 | AGG|GTGAGTGTTC...TATGCAATAGCA/GCAGTATGCAAT...TGCAG|ACG | 1 | 1 | 79.821 |
| 78405802 | GT-AG | 0 | 1.000000099473604e-05 | 366 | rna-XM_005038275.2 14614476 | 29 | 90976440 | 90976805 | Ficedula albicollis 59894 | CAG|GTGAGCTGCT...TGGGACTCATCT/CTGGGACTCATC...TTCAG|GAC | 0 | 1 | 84.832 |
| 78405803 | GT-AG | 0 | 1.000000099473604e-05 | 358 | rna-XM_005038275.2 14614476 | 30 | 90975863 | 90976220 | Ficedula albicollis 59894 | CAG|GTGAGGGTGA...CCACGCTCACAC/CCCACGCTCACA...CCTAG|ACA | 0 | 1 | 89.732 |
| 78405804 | GT-AG | 0 | 1.000000099473604e-05 | 355 | rna-XM_005038275.2 14614476 | 31 | 90975380 | 90975734 | Ficedula albicollis 59894 | CAG|GTGAGTCTGC...CCAGCTTTGAGT/TTTGAGTTGATG...CCCAG|TTA | 2 | 1 | 92.595 |
| 78405805 | GT-AG | 0 | 1.000000099473604e-05 | 1232 | rna-XM_005038275.2 14614476 | 32 | 90974057 | 90975288 | Ficedula albicollis 59894 | AAG|GTAGGAACCT...TTGGTCTTCTTT/AACGCAGTAACA...TTCAG|CTG | 0 | 1 | 94.631 |
| 78405806 | GT-AG | 0 | 1.000000099473604e-05 | 409 | rna-XM_005038275.2 14614476 | 33 | 90973579 | 90973987 | Ficedula albicollis 59894 | AAG|GTGAGGATGG...CCCCCCTGAGCC/GCCCCCCTGAGC...CCCAG|CTG | 0 | 1 | 96.174 |
| 78405807 | GT-AG | 0 | 1.000000099473604e-05 | 574 | rna-XM_005038275.2 14614476 | 34 | 90972902 | 90973475 | Ficedula albicollis 59894 | CGG|GTGAGTGCCG...TCTCCCTTCCCT/TGAGAGCTCACA...TCCAG|ATG | 1 | 1 | 98.479 |
| 78405808 | GT-AG | 0 | 1.000000099473604e-05 | 817 | rna-XM_005038275.2 14614476 | 35 | 90972043 | 90972859 | Ficedula albicollis 59894 | CAG|GTAGGTGCAC...TTGTCCTTTTCC/ATTTGTCTCACT...TCCAG|TCA | 1 | 1 | 99.418 |
Advanced export
JSON shape: default, array, newline-delimited, object
CREATE TABLE "introns" (
"id" INTEGER,
"dinucleotide_pair" TEXT,
"is_minor" INTEGER,
"score" REAL,
"length" INTEGER,
"transcript_id" INTEGER,
"ordinal_index" INTEGER,
"start" INTEGER,
"end" INTEGER,
"taxonomy_id" INTEGER,
"scored_motifs" TEXT,
"phase" INTEGER,
"in_cds" INTEGER,
"relative_position" REAL
,PRIMARY KEY ([id]),
FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
ON [introns] ([in_cds]);