Facets (new session)
Description
Metadata
Settings
owl:sameAs
Inference Rule:
b3s
b3sifp
dbprdf-label
facets
http://dbpedia.org/resource/inference/rules/dbpedia#
http://dbpedia.org/resource/inference/rules/opencyc#
http://dbpedia.org/resource/inference/rules/umbel#
http://dbpedia.org/resource/inference/rules/yago#
http://dbpedia.org/schema/property_rules#
http://www.ontologyportal.org/inference/rules/SUMO#
http://www.ontologyportal.org/inference/rules/WordNet#
http://www.w3.org/2002/07/owl#
ldp
oplweb
skos-trans
virtrdf-label
None
About:
Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions
Goto
Sponge
NotDistinct
Permalink
An Entity of Type :
schema:ScholarlyArticle
, within Data Space :
covidontheweb.inria.fr
associated with source
document(s)
Type:
Academic Article
research paper
schema:ScholarlyArticle
New Facet based on Instances of this Class
Attributes
Values
type
Academic Article
research paper
schema:ScholarlyArticle
isDefinedBy
Covid-on-the-Web dataset
title
Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions
Creator
Yu, Hai
Xia, Ningshao
Liu, Liqin
Chen, Tingting
Huang, Yang
Li, Jiajia
Li, Yuqian
Gu, Ying
Li, Shaowei
Zheng, Qingbing
Zhou, Lizhi
Wang, Yingbin
Ma, Yue
Kong, Zhibo
Wang, Kaihang
source
PMC
abstract
BACKGROUND: The Escherichia coli ER2566 strain (NC_CP014268.2) was developed as a BL21 (DE3) derivative strain and had been widely used in recombinant protein expression. However, like many other current RefSeq annotations, the annotation of the ER2566 strain was incomplete, with missing gene names and miscellaneous RNAs, as well as uncorrected annotations of some pseudogenes. Here, we performed a systematic reannotation of the ER2566 genome by combining multiple annotation tools with manual revision to provide a comprehensive understanding of the E. coli ER2566 strain, and used high-throughput sequencing to explore how the strain adapted under external pressure. RESULTS: The reannotation included noteworthy corrections to all protein-coding genes, led to the exclusion of 190 hypothetical genes or pseudogenes, and resulted in the addition of 237 coding sequences and 230 miscellaneous noncoding RNAs and 2 tRNAs. In addition, we further manually examined all 194 pseudogenes in the Ref-seq annotation and directly identified 123 (63%) as coding genes. We then used whole-genome sequencing and high-throughput RNA sequencing to assess mutational adaptations under consecutive subculture or overexpression burden. Whereas no mutations were detected in response to consecutive subculture, overexpression of the human papillomavirus 16 type capsid led to the identification of a mutation (position 1,094,824 within the 3′ non-coding region) positioned 19-bp away from the lacI gene in the transcribed RNA, which was not detected at the genomic level by Sanger sequencing. CONCLUSION: The ER2566 strain was used by both the general scientific community and the biotechnology industry. Reannotation of the E. coli ER2566 strain not only improved the RefSeq data but uncovered a key site that might be involved in the transcription and translation of genes encoding the lactose operon repressor. We proposed that our pipeline might offer a universal method for the reannotation of other bacterial genomes with high speed and accuracy. This study might facilitate a better understanding of gene function for the ER2566 strain under external burden and provided more clues to engineer bacteria for biotechnological applications.
has issue date
2020-06-16
(
xsd:dateTime
)
bibo:doi
10.1186/s12864-020-06818-1
has license
cc-by
sha1sum (hex)
9f22ea01b862c47c118e1b0d27f233357d7ad0fe
schema:url
https://doi.org/10.1186/s12864-020-06818-1
resource representing a document's title
Genome re-sequencing and reannotation of the Escherichia coli ER2566 strain and transcriptome sequencing under overexpression conditions
has PubMed Central identifier
PMC7296898
schema:publication
BMC Genomics
resource representing a document's body
covid:9f22ea01b862c47c118e1b0d27f233357d7ad0fe#body_text
is
schema:about
of
named entity 'RNAs'
named entity 'hypothetical'
named entity 'current'
named entity 'genome'
named entity 'adaptations'
named entity 'Escherichia coli'
named entity 'NON-'
named entity 'RECOMBINANT PROTEIN EXPRESSION'
named entity 'COMBINING'
named entity 'SUBCULTURE'
named entity 'EXTERNAL'
named entity 'missing'
named entity 'RNAs'
named entity 'type'
named entity 'coding sequences'
named entity 'capsid'
named entity 'RNA'
named entity 'Escherichia coli'
named entity 'noncoding RNAs'
named entity 'human papillomavirus 16'
named entity 'recombinant protein expression'
named entity 'gene'
named entity 'genome'
named entity 'Results'
named entity 'genetic background'
named entity '0.9'
named entity 'GeneMark'
named entity 'NGS'
named entity 'ncRNAs'
named entity 'Escherichia coli'
named entity 'RNA modification'
named entity 'high-quality'
named entity 'ncRNAs'
named entity 'polymorphisms'
named entity 'Sanger'
named entity 'non-coding RNAs'
named entity 'proteogenomics'
named entity 'web server'
named entity 'plasmid'
named entity 'mutation'
named entity 'tRNAs'
named entity 'RNA'
named entity 'sequence variants'
named entity 'spontaneous deamination'
named entity 'Assay'
named entity 'RefSeq'
named entity 'E. coli'
named entity 'rRNA'
named entity 'indels'
named entity 'E. coli'
named entity 'CTAB'
named entity 'mutation'
named entity '2.6'
named entity 'genetic change'
named entity 'lactose operon'
named entity 'protein'
named entity 'DNA-directed RNA polymerase'
named entity 'short-read sequencing'
named entity 'ribosomal RNA'
named entity 'mobile genetic elements'
named entity 'protein homology'
named entity 'gene'
named entity 'GenBank'
named entity 'genome sequence'
named entity 'thymine'
named entity 'transcription'
named entity 'tRNAs'
named entity 'genome'
named entity 'high-throughput'
◂◂ First
◂ Prev
Next ▸
Last ▸▸
Page 1 of 7
Go
Faceted Search & Find service v1.13.91 as of Mar 24 2020
Alternative Linked Data Documents:
Sponger
|
ODE
Content Formats:
RDF
ODATA
Microdata
About
OpenLink Virtuoso
version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software