home / WtMTA

introns

Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)

id
INTEGER (primary key), globally unique identifier for each intron
dinucleotide_pair
TEXT, terminal dinucleotide sequences of the intron
is_minor
INTEGER, indicates if the intron is a minor intron (1) or not (0)
score
REAL, score representing the probability (0-100%) of the intron being minor
length
INTEGER, length of the intron in base pairs
transcript_id
INTEGER (foreign key referencing transcripts(id)), parent transcript
ordinal_index
INTEGER, ordinal position of the intron within the transcript (e.g., 3 for the third intron)
start
INTEGER, start position of the intron in the genome
end
INTEGER, end position of the intron in the genome
taxonomy_id
INTEGER (foreign key referencing genomes(taxonomy_id)), NCBI taxonomy identifier for species
scored_motifs
TEXT, motifs scored for the intron
phase
INTEGER, phase of the intron in coding sequence (0, 1, or 2 or null for introns outside of coding sequence)
in_cds
INTEGER, indicates if the intron is within the coding sequence (1) or not (0; e.g., UTR introns)
relative_position
REAL, relative position of the intron within the transcript (as a percentage of coding length)

30 rows where transcript_id = 720761

✎ View and edit SQL

This data as json, CSV (advanced)

Suggested facets: score, length, phase

id ▼ dinucleotide_pair is_minor score length transcript_id ordinal_index start end taxonomy_id scored_motifs phase in_cds relative_position
3821935 GT-AG 0 1.000000099473604e-05 48 rna-gnl|I4U23|002624-T1 720761 1 8302655 8302702 Adineta vaga 104782 TGG|GTTTGTTGAA...AGAATTTTGTCG/AAAAATTTCATC...TTCAG|GCT 0 1 0.987
3821936 GT-AG 0 1.000000099473604e-05 51 rna-gnl|I4U23|002624-T1 720761 2 8302797 8302847 Adineta vaga 104782 CTT|GTAAGAAATT...GATACTTTATTT/GAAGTTTTGATA...TTTAG|GTT 1 1 2.393
3821937 GT-AG 0 0.0007777215778802 58 rna-gnl|I4U23|002624-T1 720761 3 8302975 8303032 Adineta vaga 104782 TTC|GTACGTAGAT...TTTTTCTTTGTT/CAATAAATCATT...AAAAG|ATT 2 1 4.292
3821938 GT-AG 0 1.000000099473604e-05 69 rna-gnl|I4U23|002624-T1 720761 4 8303164 8303232 Adineta vaga 104782 ATG|GTAAGAGTAG...ATCTTTTTATCA/TATCTTTTTATC...AAAAG|GTA 1 1 6.251
3821939 GT-AG 0 0.000256114087308 59 rna-gnl|I4U23|002624-T1 720761 5 8303433 8303491 Adineta vaga 104782 TAT|GTAAATCTGA...TTTTTCTTCATT/TTTTTCTTCATT...TCTAG|TCA 0 1 9.242
3821940 GT-AG 0 1.000000099473604e-05 49 rna-gnl|I4U23|002624-T1 720761 6 8303675 8303723 Adineta vaga 104782 ATG|GTAAAAGATA...GAGTGTTTGATC/GAGTGTTTGATC...TTTAG|GTG 0 1 11.978
3821941 GT-AG 0 1.406197478578873e-05 53 rna-gnl|I4U23|002624-T1 720761 7 8303772 8303824 Adineta vaga 104782 CAA|GTAGTTGTTT...TGGGTTTTCATG/TGGGTTTTCATG...TTTAG|ATT 0 1 12.696
3821942 GT-AG 0 3.792094531974254e-05 144 rna-gnl|I4U23|002624-T1 720761 8 8303911 8304054 Adineta vaga 104782 CAA|GTAAGTTGAA...CAGTTTTTGACG/ATTCTATTCATG...TTTAG|TTT 2 1 13.982
3821943 GT-AG 0 0.0003711829352495 62 rna-gnl|I4U23|002624-T1 720761 9 8304838 8304899 Adineta vaga 104782 TCG|GTATGAAGTT...TAGATTTTAACA/TAGATTTTAACA...TGTAG|ATA 2 1 25.692
3821944 GT-AG 0 0.0074242426778751 49 rna-gnl|I4U23|002624-T1 720761 10 8305077 8305125 Adineta vaga 104782 TAA|GTATGTTGAA...TGTTCATTATTG/TCATTATTGATC...TTTAG|ATT 2 1 28.339
3821945 GT-AG 0 1.000000099473604e-05 59 rna-gnl|I4U23|002624-T1 720761 11 8305228 8305286 Adineta vaga 104782 AAA|GTAAAGATCT...AAATTCTTTTCT/GTTTTTTTCAAA...TAAAG|GCG 2 1 29.864
3821946 GT-AG 0 1.000000099473604e-05 78 rna-gnl|I4U23|002624-T1 720761 12 8305408 8305485 Adineta vaga 104782 AAG|GTACGTAGAG...TTTTTTTTCTTC/AGTTAAATCATT...TCTAG|TTT 0 1 31.673
3821947 GT-AG 0 1.000000099473604e-05 55 rna-gnl|I4U23|002624-T1 720761 13 8305616 8305670 Adineta vaga 104782 GTG|GTATGACATT...ATCTTTCTAATT/ATCTTTCTAATT...TTCAG|GAG 1 1 33.617
3821948 GT-AG 0 8.810966256082479e-05 57 rna-gnl|I4U23|002624-T1 720761 14 8305763 8305819 Adineta vaga 104782 GCT|GTAAGATTTT...TGGTTCTCAATT/ATGGTTCTCAAT...TTTAG|TGT 0 1 34.993
3821949 GT-AG 0 4.832439411292482e-05 48 rna-gnl|I4U23|002624-T1 720761 15 8306025 8306072 Adineta vaga 104782 CCG|GTATGTACGA...TTGAATTTGAAT/TTGAATTTGAAT...TCTAG|GTG 1 1 38.059
3821950 GT-AG 0 1.7294310210568202e-05 60 rna-gnl|I4U23|002624-T1 720761 16 8306199 8306258 Adineta vaga 104782 CTG|GTTAGTTTTA...ACCTTCTTGATT/TTAAAACTAATT...TTTAG|ATA 1 1 39.943
3821951 GT-AG 0 0.0001036604457784 56 rna-gnl|I4U23|002624-T1 720761 17 8306581 8306636 Adineta vaga 104782 TGA|GTAAGTTCAA...TTTGCCGTGATG/ATTGATTTGAAT...TTTAG|ATT 2 1 44.758
3821952 GT-AG 0 0.0009286020109971 106 rna-gnl|I4U23|002624-T1 720761 18 8306822 8306927 Adineta vaga 104782 CAG|GTATTTATTT...TCTTTTTCAATA/TTCTTTTTCAAT...CATAG|GTG 1 1 47.525
3821953 GT-AG 0 1.000000099473604e-05 59 rna-gnl|I4U23|002624-T1 720761 19 8307606 8307664 Adineta vaga 104782 GTG|GTAGGAGTTT...ATTTCTTTCATA/ATTTCTTTCATA...TCTAG|ACG 1 1 57.664
3821954 GT-AG 0 1.000000099473604e-05 77 rna-gnl|I4U23|002624-T1 720761 20 8308474 8308550 Adineta vaga 104782 TCA|GTAAGTAAAT...TTTGCTTTACAA/ATAACATTCACT...TATAG|GTC 0 1 69.762
3821955 GT-AG 0 0.0003148876905948 63 rna-gnl|I4U23|002624-T1 720761 21 8308870 8308932 Adineta vaga 104782 AAG|GTAAACATTT...CAAGCTTTAATT/TTTAATTTGATC...TTTAG|GTA 1 1 74.533
3821956 GT-AG 0 0.0014142376563428 62 rna-gnl|I4U23|002624-T1 720761 22 8309359 8309420 Adineta vaga 104782 ACA|GTTTGTTATA...GTGTTTTTAGAG/AAATGTTTCAAT...TTCAG|GTT 1 1 80.903
3821957 GT-AG 0 1.000000099473604e-05 66 rna-gnl|I4U23|002624-T1 720761 23 8309548 8309613 Adineta vaga 104782 TCG|GTAATGATAA...TTTTCCTTGTTG/CAATTCTTTACA...CATAG|AGG 2 1 82.802
3821958 GT-AG 0 1.000000099473604e-05 53 rna-gnl|I4U23|002624-T1 720761 24 8309747 8309799 Adineta vaga 104782 AAA|GTAAAAACAA...GATATTTCAATG/AGATATTTCAAT...TATAG|ATA 0 1 84.791
3821959 GT-AG 0 1.000000099473604e-05 59 rna-gnl|I4U23|002624-T1 720761 25 8310018 8310076 Adineta vaga 104782 CGG|GTAGAAGAAA...TTCTTTTGGATT/TTTGGATTAAGA...TTTAG|TTA 2 1 88.051
3821960 GT-AG 0 4.840599280860761e-05 53 rna-gnl|I4U23|002624-T1 720761 26 8310180 8310232 Adineta vaga 104782 TTT|GTAATTGATT...TTGTTCTTTTTT/AAGATTTTCATA...TTTAG|ATA 0 1 89.592
3821961 GT-AG 0 1.000000099473604e-05 225 rna-gnl|I4U23|002624-T1 720761 27 8310371 8310595 Adineta vaga 104782 GAT|GTAAGAATCA...TTTCCAATGAAT/ACTAAATTGAAT...TTTAG|GAA 0 1 91.655
3821962 GT-AG 0 1.9647196843759483e-05 56 rna-gnl|I4U23|002624-T1 720761 28 8310729 8310784 Adineta vaga 104782 ATC|GTAAATAACA...AAAATTTTGAAA/ACGAAACTTATT...TTTAG|GAC 1 1 93.644
3821963 GT-AG 0 0.0011916318826325 53 rna-gnl|I4U23|002624-T1 720761 29 8310849 8310901 Adineta vaga 104782 AAA|GTCTTTTAAA...TTTCTATTAGCA/TCTTTTTCTATT...TCTAG|ATT 2 1 94.601
3821964 GT-AG 0 3.9718656807373485e-05 52 rna-gnl|I4U23|002624-T1 720761 30 8311252 8311303 Adineta vaga 104782 TTC|GTAAGTGACG...TTTTCCGTAACT/AATATTTTGATC...AGCAG|ATC 1 1 99.836

Advanced export

JSON shape: default, array, newline-delimited, object

CSV options:

CREATE TABLE "introns" (
"id" INTEGER,
  "dinucleotide_pair" TEXT,
  "is_minor" INTEGER,
  "score" REAL,
  "length" INTEGER,
  "transcript_id" INTEGER,
  "ordinal_index" INTEGER,
  "start" INTEGER,
  "end" INTEGER,
  "taxonomy_id" INTEGER,
  "scored_motifs" TEXT,
  "phase" INTEGER,
  "in_cds" INTEGER,
  "relative_position" REAL
  ,PRIMARY KEY ([id]),
   FOREIGN KEY([transcript_id]) REFERENCES [transcripts]([id]),
   FOREIGN KEY([taxonomy_id]) REFERENCES [genomes]([taxonomy_id])
);
CREATE INDEX [idx_introns_transcript_id]
    ON [introns] ([transcript_id]);
CREATE INDEX [idx_introns_taxonomy_id]
    ON [introns] ([taxonomy_id]);
CREATE INDEX [idx_introns_phase]
    ON [introns] ([phase]);
CREATE INDEX [idx_introns_is_minor]
    ON [introns] ([is_minor]);
CREATE INDEX [idx_introns_dinucleotide_pair]
    ON [introns] ([dinucleotide_pair]);
CREATE INDEX [idx_introns_score]
    ON [introns] ([score]);
CREATE INDEX [idx_introns_in_cds]
    ON [introns] ([in_cds]);
Powered by Datasette · Queries took 27.52ms · Data license: ODbL · Data source: Larue & Roy, 2023 · About: Minor Intron Database (WtMTA)