About: Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32 non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified inhabitants of the healthy tissue flora as well as experimental contaminants. Unmapped sequences that co-occur with high statistical significance potentially represent the unknown sequence space where novel pathogens can be identified.

Facets (new session)
Description
Metadata
Settings
- owl:sameAs
- Inference Rule:

About: Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32 non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified inhabitants of the healthy tissue flora as well as experimental contaminants. Unmapped sequences that co-occur with high statistical significance potentially represent the unknown sequence space where novel pathogens can be identified. Goto Sponge NotDistinct Permalink

An Entity of Type : fabio:Abstract, within Data Space : covidontheweb.inria.fr associated with source document(s)

Attributes	Values
type	abstract
value	Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32 non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified inhabitants of the healthy tissue flora as well as experimental contaminants. Unmapped sequences that co-occur with high statistical significance potentially represent the unknown sequence space where novel pathogens can be identified.
Subject	Virology Infectious diseases DNA Molecular biology Forensic genetics Sequence spaces
part of	Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers
is abstract of	Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers
is hasSource of	covid:ann/target/377bfc19dc4e16e7b12881f905aa15158327010b covid:ann/target/a14a4a7508cd6a8cd1a14a263c0c31a1e8600b55 covid:ann/target/eee5cf60cedfc32b1812ec7c96f633357a715163 covid:ann/target/3a219f08156c9cc5b4f36c999db4e886a20fd8b6 covid:ann/target/f9cd6ed9bc2e4f82d5aa891739dac78c45655e8d covid:ann/target/23363b257b523ae8fe9432b9d9019c78faec9e31 covid:ann/target/90bfe3dd0ae762aa5fc350cebf7ba942dbd0d4b9 covid:ann/target/2f835f7a9714ff97d5e0d622d46951615e873c65 covid:ann/target/8f516202ea6016026da636b7e3d4648b1b45ff29 covid:ann/target/0f197ad8d3c2b5446101c4e067f566e0ea325a6e covid:ann/target/1f48d0e19d073c98d350bb01d43f9eb84f9ebbfb covid:ann/target/d73ec1a7438dc44c5f6632a847d56f739cb48b8c covid:ann/target/82ea19441f08dbdf30ecb1d6bf6daa6b923c1c93 covid:ann/target/2c22e20f5579c6c8502301ac1d13e43ac31d01d6 covid:ann/target/7a2903afe170a57b3739f1f1df2fbaa69359593d covid:ann/target/8a28ac52fd47cb3f0c789257e58cb0f01b76444e covid:ann/target/d6811e0445875e09d6414fc2d116bc5be62ea350 covid:ann/target/dba6019f69bdd7d0d70057c2013d019e748b5290 covid:ann/target/30d135b7851ad282cc1f5130d3f18ec66fa8233e covid:ann/target/58ef524e66625063b9f4a7666b8d966d1333bbd2 covid:ann/target/2fd9b49079fee0be9686c23218fbcd38db20b182 covid:ann/target/3c246b159ade2af0e5161121108be27be423c2fc covid:ann/target/bc8e3c2aa7530cf6e8a0eb93dfdd4fecaee27255 covid:ann/target/c7fe87bc59e809f63a39b4f3a20c02f8192d4be8 covid:ann/target/faeac57656b1cdd8d169a34305d3d6f80f44d6bf covid:ann/target/7faf560707e914359ed7bfb761fbf53e75fa39f8 covid:ann/target/b9a34b31f4b2fd0cfa159ca280c3f0f84c08734b covid:ann/target/0f616f1bbe819d81daac95aef7e4beab97466607 covid:ann/target/fa5d96aa50eb9c4d180c16d9f84e3c907346e7f8 covid:ann/target/cb4ec7e89d6ea9a7f110927f5cab5c43bef1d19f covid:ann/target/582f345c8f67df04b66278743919339505a45714 covid:ann/target/92b98e0a1e0fecd15faaa612a9208d77e70d54b9 covid:ann/target/b1a325924053ab5f3fdd64aae55924f4b62189e9 covid:ann/target/ed7caad440a2482b0eef731ad0fad2783fc8fdfc covid:ann/target/4aa69fe0a2f7bd457a46d4123a5ea4701f3f9bcb covid:ann/target/6c915a5a6bcc21fc6d563b887e7ce6bbca756f6f covid:ann/target/e4ab2fe13b48eb2d357b57b4d41c61b6aa766ab5 covid:ann/target/a10d47a511866735ee8c4e8ea72df9c5f9fe37ed covid:ann/target/08a914f6ed0290206c69888be95aec0444bf1f45 covid:ann/target/5635c896e63c3eb1ceaad63c80ea5e804d37ccc0 covid:ann/target/aa71c4cbd4fc1545e112d57431627c3bd2dca048 covid:ann/target/6b6a396d20e3e6181336134660ed573b4f5dfdff covid:ann/target/12b051aad99c2f5daff7717023177c96de94b90b covid:ann/target/07bab5273ce70528a8a901110baf1680ccaa107e covid:ann/target/2125484742b11d4766105c17fef9f8f477b59e8f covid:ann/target/624cfe4ebfc81cf91e086616f146c4efffb95837 covid:ann/target/6689f5d8eca56091c247cf1772dd9fa87c14d097 covid:ann/target/417250113e5af903d8f3ec5d6354849f8cc92335 covid:ann/target/575d885e348cedd97ddd5d110b393af4af958b0b covid:ann/target/6e7c60cb550b5235e72ddd1c9da239e6515a939a covid:ann/target/ed30de244f90aee36e23ef8ed2833e3cc377a024 covid:ann/target/26d0bf776360a7c20bf4170254593d1e4ea18d12 covid:ann/target/790386d8b83592f8019e6576d0666b55a8a7ce03 covid:ann/target/6da1234fd3b0814f8a5652741ae56a98e9c2823a covid:ann/target/7d99b1224fe42975e722fa6d4085cedd1c8afe1f covid:ann/target/81c3902f28e1bb5b0129233527f35bcfc1176d1e covid:ann/target/b82c12a17f310c920b65530f05e662ea2ac3b5bd covid:ann/target/6464445bbd1cc95254899fcde712a84d22af50db covid:ann/target/bff5fde622337f3aebd35fd01594630bff4e60ba covid:ann/target/7c34eed53b0b1f5a307f3db30381eb3564bd7d2b covid:ann/target/23a2f04ce064c1a9f3b623c3547003bff7209229 covid:ann/target/ff00a6b31bd37ea24b5465d82ab9ae18e4d33427 covid:ann/target/381d8551da1c754cef5696588ca45c06c5f495a4 covid:ann/target/9b68c92ee38b944505c053dab811c781d2b14f24 covid:ann/target/bd3b30da15ea24b5b5749096042eb90cd04e83c0 covid:ann/target/8b84eecb2a157a0b4df4cc29da53b52b2baa4ae1 covid:ann/target/3f6c1357c3ca55d9d7e213f9d9f5741c4f9b2c4e covid:ann/target/9f49f3b71b4a0db9f30d80062258b47cebe31f7b covid:ann/target/4b2eadf3bc46856642062a2941de8520c4730070 covid:ann/target/8398f25b6acab5e6eb79a47b39886d7f10e31196 covid:ann/target/ce4a0ff43e8bdbd873a7aab00e6c3e56b554d34a covid:ann/target/d888a20aef092802842cf04e84c3e7c5b1a95464 covid:ann/target/36f2b1834c548f99fc66bddf775740e19e682c4e covid:ann/target/b5b5d9623240ff167ad7f421a08faed32fc4deaa covid:ann/target/ee2b7c31067a284d4cdc2a15789b2a532a06e935 covid:ann/target/e47263af742df67e6b6d26f1f29f065c14d7b04d covid:ann/target/6138c665c99f92f73770a9f5c2d3b458dc6c2f83 covid:ann/target/d77fc27950609b08beb9d2924e6cceddf1ba8249 covid:ann/target/9e9723c754babf3f612bb6df19e5bbee713131d9 covid:ann/target/61b83c1310d869657e82653bc074f099e9b5898e covid:ann/target/7c82edffa2bad687871d1e1668bf2e17bec97ba0 covid:ann/target/d5c6bbce404d76db35fd2715550674192ece52c6 covid:ann/target/86faa84a3fb5b31128964de6b466d53445c4d71b covid:ann/target/48df9c3894375d69240e992b9b996490e3044712 covid:ann/target/b8ffbb1dd66a0662cdffc2463fb3437c7fd9bad1 covid:ann/target/3a2b0179636eeb3737b9c2e42f7f5938ed2eecec covid:ann/target/0fc715473033421dda0affc84515467fe895cbf2 covid:ann/target/3fed057111a2b065f17dbc3323a04e54ee06d80c covid:ann/target/e6568e9aa77793ffa4d6c36db3e7fab896b79330 covid:ann/target/66c40d8fda9c08ff458cffc309e7218669034152

Faceted Search & Find service v1.13.91 as of Mar 24 2020

Alternative Linked Data Documents: Sponger | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 07.20.3229 as of Jul 10 2020, on Linux (x86_64-pc-linux-gnu), Single-Server Edition (94 GB total memory)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software