home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

30 rows where transcript_id = 12801882

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: score, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
68111043 GT-AG 0 1.000000099473604e-05 6005 rna-XM_029507516.1 12801882 1 22412804 22418808 Echeneis naucrates 173247 GAG|GTGAGTGAGG...AGCACCTTCACG/AGCACCTTCACG...CCAAG|GCT 0 1 2.234
68111044 GT-AG 0 0.0001907018753054 118 rna-XM_029507516.1 12801882 2 22418880 22418997 Echeneis naucrates 173247 CGA|GTAAGCTAGT...CAAGTCTAAATA/TCAAGTCTAAAT...CACAG|GCT 2 1 3.589
68111045 GT-AG 0 1.000000099473604e-05 101 rna-XM_029507516.1 12801882 3 22419068 22419168 Echeneis naucrates 173247 CTG|GTAAGGAGAC...TAAACCCTGACT/TAAACCCTGACT...TCTAG|GAC 0 1 4.926
68111046 GT-AG 0 2.853361562954884e-05 88 rna-XM_029507516.1 12801882 4 22419276 22419363 Echeneis naucrates 173247 GAA|GTAAATATGC...TCTGATTTAATC/ACATGTCTGATT...ATCAG|GGA 2 1 6.968
68111047 GT-AG 0 1.000000099473604e-05 102 rna-XM_029507516.1 12801882 5 22419395 22419496 Echeneis naucrates 173247 AAT|GTGAGTTGTT...GCATCTTTGTTT/TTAGATCTCATG...ACCAG|GTG 0 1 7.56
68111048 GT-AG 0 1.000000099473604e-05 641 rna-XM_029507516.1 12801882 6 22419608 22420248 Echeneis naucrates 173247 GTG|GTGAGTTAGC...AAAACCTAAGTT/AAAAACCTAAGT...TTTAG|GAG 0 1 9.679
68111049 GT-AG 0 1.000000099473604e-05 212 rna-XM_029507516.1 12801882 7 22420377 22420588 Echeneis naucrates 173247 CAG|GTGAGAGACG...CTTTTCTTTTCT/GATTAAATAAGT...GACAG|TCT 2 1 12.123
68111050 GT-AG 0 1.000000099473604e-05 939 rna-XM_029507516.1 12801882 8 22420660 22421598 Echeneis naucrates 173247 ATG|GTTAGTTCTC...TTGCTCTTGATT/TACTTTCTAACC...CCCAG|GGC 1 1 13.478
68111051 GT-AG 0 3.4614923263664886e-05 166 rna-XM_029507516.1 12801882 9 22421801 22421966 Echeneis naucrates 173247 CAG|GTAACGGACA...CATCCCTTATCT/TAAAGTTTCATC...TCTAG|TTG 2 1 17.335
68111052 GT-AG 0 1.000000099473604e-05 491 rna-XM_029507516.1 12801882 10 22422165 22422655 Echeneis naucrates 173247 CAG|GTCAGAGAGG...GGATTTTTGACC/GGATTTTTGACC...TCCAG|CCC 2 1 21.115
68111053 GT-AG 0 1.000000099473604e-05 203 rna-XM_029507516.1 12801882 11 22422962 22423164 Echeneis naucrates 173247 CAG|GTTAGCTGCG...TGATGTGTAACA/TGATGTGTAACA...GACAG|TGA 2 1 26.957
68111054 GT-AG 0 0.0004267618050167 1504 rna-XM_029507516.1 12801882 12 22423435 22424938 Echeneis naucrates 173247 CAG|GTGCCCAACA...GTGTCCTCATTT/TGTGTCCTCATT...TACAG|TGA 2 1 32.111
68111055 GT-AG 0 2.3627092658093206e-05 993 rna-XM_029507516.1 12801882 13 22425197 22426189 Echeneis naucrates 173247 CAG|GTAACGACAA...TTATCCTTCATT/TTATCCTTCATT...TACAG|CCA 2 1 37.037
68111056 GT-AG 0 1.000000099473604e-05 109 rna-XM_029507516.1 12801882 14 22426550 22426658 Echeneis naucrates 173247 CAG|GTGGGCTGCA...TGTGTTTTACAT/TGTTTAGTTATT...TATAG|GAG 2 1 43.91
68111057 GT-AG 0 0.0018360462089756 1059 rna-XM_029507516.1 12801882 15 22426998 22428056 Echeneis naucrates 173247 AAG|GTATTTAATG...TACACCTTAGTC/CTTAGTCTGAGT...TTTAG|CAG 2 1 50.382
68111058 GT-AG 0 1.000000099473604e-05 243 rna-XM_029507516.1 12801882 16 22428390 22428632 Echeneis naucrates 173247 AAT|GTGAGATGCC...TTTCCCTTACCT/TTTCCTCTCATT...TTTAG|GAT 2 1 56.739
68111059 GT-AG 0 4.124466815004531e-05 982 rna-XM_029507516.1 12801882 17 22428672 22429653 Echeneis naucrates 173247 AGA|GTAAGTCTTT...TATGCCTTTTCT/CCTTTTCTGATT...TCTAG|GAG 2 1 57.484
68111060 GT-AG 0 1.000000099473604e-05 127 rna-XM_029507516.1 12801882 18 22429784 22429910 Echeneis naucrates 173247 GAT|GTAAGAAACT...CATTATTTGACA/CATTATTTGACA...GGCAG|GAT 0 1 59.966
68111061 GT-AG 0 1.000000099473604e-05 116 rna-XM_029507516.1 12801882 19 22430069 22430184 Echeneis naucrates 173247 GAA|GTAAGGATGC...ACTACCATAACA/TCTCTGCTTATT...TCTAG|GGA 2 1 62.982
68111062 GT-AG 0 1.000000099473604e-05 125 rna-XM_029507516.1 12801882 20 22430846 22430970 Echeneis naucrates 173247 AAG|GTGAGGGTGT...TGATCCTTATCA/GTGATCCTTATC...CTCAG|ATG 0 1 75.601
68111063 GT-AG 0 1.000000099473604e-05 1078 rna-XM_029507516.1 12801882 21 22431304 22432381 Echeneis naucrates 173247 CAG|GTAAGGGTTA...AAAGTGTTACTC/TTCTGGTTTACA...TTTAG|GTC 0 1 81.959
68111064 GT-AG 0 1.999753978186752e-05 177 rna-XM_029507516.1 12801882 22 22432485 22432661 Echeneis naucrates 173247 CAG|GTAAATCTGT...TTTCTTTTAGCA/TTTTAATTTATT...CACAG|ACG 1 1 83.925
68111065 GT-AG 0 1.000000099473604e-05 183 rna-XM_029507516.1 12801882 23 22432688 22432870 Echeneis naucrates 173247 ACT|GTGAGTATCT...GTCCTTTTGGTG/TTGGTGCGTATA...GTTAG|ATT 0 1 84.422
68111066 GT-AG 0 0.0012304785562252 153 rna-XM_029507516.1 12801882 24 22432985 22433137 Echeneis naucrates 173247 CAG|GTACACTCGT...GTGTGTTTAACT/GTGTGTTTAACT...TGCAG|GAG 0 1 86.598
68111067 GT-AG 0 1.000000099473604e-05 122 rna-XM_029507516.1 12801882 25 22433231 22433352 Echeneis naucrates 173247 GAG|GTGCAAGCAC...TTTTCTGTACTT/CATAATATTATA...TCCAG|GAA 0 1 88.373
68111068 GT-AG 0 1.000000099473604e-05 111 rna-XM_029507516.1 12801882 26 22433549 22433659 Echeneis naucrates 173247 AGG|GTGGGCTTGC...TATTCCTTTTTG/ATTTGTTTGATG...TACAG|ATT 1 1 92.115
68111069 GT-AG 0 1.000000099473604e-05 348 rna-XM_029507516.1 12801882 27 22433760 22434107 Echeneis naucrates 173247 AAG|GTGTGTGTGT...TTCTTTTTATTT/TTTCTTTTTATT...TGCAG|GTC 2 1 94.024
68111070 GT-AG 0 7.420550833529969e-05 108 rna-XM_029507516.1 12801882 28 22434183 22434290 Echeneis naucrates 173247 CAG|GTAATCCGCA...TGTATCTTAATT/TGTATCTTAATT...TGTAG|GTG 2 1 95.456
68111071 GT-AG 0 0.0003360245099999 274 rna-XM_029507516.1 12801882 29 22434400 22434673 Echeneis naucrates 173247 AAG|GTATTGTCTT...CTTTCTATGATT/TGGTTGTTCACT...CATAG|TGT 0 1 97.537
68111072 GT-AG 0 1.000000099473604e-05 162 rna-XM_029507516.1 12801882 30 22434777 22434938 Echeneis naucrates 173247 GGG|GTGAGAACAC...CTGATTTTCGTG/GAAACGCTGATT...TGCAG|GCA 1 1 99.504

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 35.099ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)