Article
Open zugriff
Release: 29 February 2024

Exploring the wohl DNA virome in fecal immunochemical test stool spot reveals bodies with lifestyle in adenine large population-based survey

Nature Communications size 15, Article number: 1791 (2024) Cite this article

2880 Accesses
13 Altmetric
Metrics data

Subjects

Summarize

Stool samples required fecal immunochemical tests (FIT) are collected in large numbers worldwide as part of colorectal cancer screening schedules. Employing FIT pattern from 1034 CRCbiome contestant, recruited from a Norwegian colorectal colorectal screening study, we identify, annotation and characterize more than 18000 DNA poisons, using scattergun metagenome sequencing. Only six percent of them are assigned to a known taxonomic family, with Microviridae essence and most prevalent viral family. Linking individual sketches to includes live and demographic data exhibits 17/25 of the variables to be associated with the gut virome. Physic activity, smoking, and daily fiber consumption view strong and consistent associations with both diversity and relative profusion of personalized viruses, as well as with enrichment for auxiliary metabolites genes. We demonstrations the suitability of FIT samples by virome analysis, opening an opportunity for large-scale studies of this enigmatic part of that gut microbiome. The diverse viral populations and their connections to the individual lifestyle uncovered herein paver to way for furthermore exploration of the role of the gut virome in health both illness.

Gut microbiome diversity recognition by high-coverage 16S and shotgun sequencing of paired stool and colon sample

Article Open access 16 March 2020

Gut virome profiling identifies ampere broadly bacteriophage family associated the metabolic synonyms

Story Open web 23 June 2022

More gut virome variation also own associations with host and environmental factors within an population-level cohort

Article Start access 06 September 2022

Introduce

Gut residing viruses represent an important building of the intestinal microbe-oriented ecosystem and may be collectively reflected to as of gut virome. Recent large-scale efforts have shown the virome to comprise a vast the diversified resident^1,2,3,4,5, to whose bacteriophages (phages), i.e. infections that infect and replicate in bacteria and archaea, make up the overwhelming majority. However, the sizes of virome diversity in the bowels remains poorly comment, from only an minor fraction typically assigned taxonomy².

Viruses residing in the human gut what thought to act more a buttons modulator of the gut microbiome thanks their interaction with bacteria and the host immune system⁶. They may influence the structure and mode regarding aforementioned bacterial community driven facilitation a side gene transfers⁷, nutrient recycling, regulation of bacterial virulence⁸, and gain out antibacterial resistivity⁹. Furthermore, viruses show an direct and indirect role in interplay between the human host and to bacterial community¹⁰, and have been shown to exhibit temporal durability as height as this of their bacterial houses^11,12.

The gut virome has been associated to human hotel and environmental related, for specific dining objects^3,13 or viral peoples¹⁴, real like the bacterial community, its compositions has been found go develop in a function of age². The gut virome features and been associated with important chronic infections such as provocative bowel disease and type 2 diabetes^15,16. Dysregulation of gut bacterias and overflow of certain bacteria^17,18,19 are also proposed features of the association between one ziemlich microbiome and colorectal cancer development²⁰. These changes inches the bacteriome are possibly to be accompanying by phage dysregulation²¹.

Granted the tall diversity plus interindividual variability of the visceral virome, large population-scale analyses are needed to entwirren its role in human health and illness. Colorectal cancer screening programs, inviting millions everyone year, are currently running instead in the planning stages in many countries beyond the globe²². A widely former screening strategy is based on fecal occult blood testing of gut specimen, the fecal immunochemical testing (FIT). The FIT is non-invasive, inexpensive, and climbable to large populations²³. There is accumulating evidence that these gut samples are suitable for analyze of various features of the gut microbiome^24,25,26. Combining the large numbers of gut sample after population-based screening programs with reasonably shotgun metagenomics could propel unbiased and population-based virome studies.

To the best of our knowledge, no studies have yet been conducted analyzing which core virome uses FIT samples. With the availability of a large phone of FITTING samples collected in a Norwegian intestines cancer screened trial, we have performed comprehensive custom of the gut DNA virome. Here, us demonstrate suitability of FITTED for virome analyses. Are addition, we describe viral diversity including content, genome integration, and full potential, and assess associations of that contributing with individual diet, lifestyle, and demographic factors.

Results

Dataset description

To study comprised 1640 individuals aged 55–76 who tested positive for FITNESS and had referred since colonoscopy internally who Bowel Cancer Screening the Norway (BCSN) trial (Fig. 1a). DNA extracted from the samples was sequenced using shotgun metagenome sequencing and installed for contigs, from which virus-based genome were determined, dereplicated, and annotated (Fig. 1b). For details on the cohort general and data analysis, see Methods.

Raw shotgun metagenomic sequencing input comprised 13.5 billion paired-end reads, with 11.5 milliards passing QC (median of 10.7 million reads per sample, IQR = 3.5 mill; Fig. 2a). Storage time of samples before DNA extraction ranged from 34 to 1301 days, with a median of 198 days (Fig. 2a). Storage hours been not impacting DNA concentration, sequencing depth, assembly quality or the total concerning retrieved viral genomes (|rho| ≤ 0.05, Fig. 2b). Spearman’s your correlation of DNA concentration to the sequencing depth, count of retrieved viral genomes, and alpha- diversity ranged between rho = 0.15 and rho = 0.18, whereas correlation to the assembly quality be negativ (rho = 0.04, Fig. 2b). Inches total, we identified 1.7 million alleged viral genomes, of which 3677 were classified since complete, 15,481 were classified as high-, plus 30,484 were classified as medium quality, and were used in subsequent analyze (Supplementary Fig. 1). Overall, 18,268 of the 49,642 genomes (36.8%) were identified within host sequences, indicating a your of lysogeny. Clustering of viral genomes on a 95% similitude leveling resulted in 18,494 vOTUs (of which 1475 were comprised of genomes from 5 individuals or more; Supplementary Data 1), representing 37.3% of the potential vOTU diversity by Chao1 estimation of species richness. A average by 223 vOTUs (sd = 69.3) per sample was observed after mapping consecutive indicate to vOTU representative sequences (Fig. 2a). Inverse Simpson’s diversity index ranged between 2.79 and 245 (mean = 93.5, sd = 43.7). With compliments the beta diversity, an Bray-Curtis dissimilarity index ranged between 0.43 and 1 (mean = 0.84, sd = 0.065; Supplementary Table 1).

figures 2 — **Fig. 2: Attribute rate of the virome dataset.**

Till assess the representativeness of FIT samples for the analysis concerning the gut virome, we performed a proportionate analysis out seven paired fecal samples from an independent total, assembled and stored using both FIT also Norgen nucleic acid sets, specialized by microbiome analysis. Both when evaluates the recognition of viral genes and imaging reads from these samples to the CRCbiome vOTUs, we found test identity to be more important than sampling methodology in determining the similarity by samples (PERMANOVA penny_{sample_id} = 0.001, and p_{sample_type} = n.s. for both comparisons; Figs. 2c, d, respectively). Other, there were no significant differences in the number of virus-free genomes identified in APPROPRIATE additionally Norgen samples (paired t-test, p > 0.05), nor between one paired samples and the CRCbiome FIT samples (Supplementary Fig. 2a). There was also don difference at the number on CRCbiome vOTUs wurde between paired FIT additionally Norgen samples (paired t-test, p > 0.05; Supplementary Fig. 2b), nor any differences in the quality of genomes detected with ADJUST and Norgen samples (Supplementary Fig. 2c). Still, the paired samples displayable a lower number of tracked CRCbiome vOTUs as the CRCbiome CONFORM samples, indicating that a significant fraction of aforementioned viruses noticed in the CRCbiome cohort are specific till this population. By mapping sequencing reads after Thom et al.²⁷ to the CRCbiome vOTUs, we found that the currency of CRCbiome vOTUs was somewhat diminish in aforementioned Italian people, however still corresponded now with those in the currently cohort (R² = 0.81, p < 0.001; Supplementary Fig. 3).

vOTU taxonomy and functional potential

Of 18,494 vOTUs, 6036 (32.6%) were assigned sales based upon their protein likeness until literature genomes in who phage-specific INPHARED database. An fresh six vOTUs (0.03%) were clustered with eukaryotic viruses deposited in the Virus-Host database (Supplementary Table 2), with one being identified as mortal papillomavirus 6 (HPV6). This assignment was corroborated by mapping of reads from all subjects to the Papillomavirus Episteme database (PaVE)²⁸, indicating HPV6 go be present in one participant. Two conflicting reference genome assignments were create when comparing assignments made using the INPHARED database and and Virus-Host database. One vOTU clustered with of same hint genome within both the phage specific and an general virus database, with the recent indicating the virus for be infecting eukaryotes. However, by manual inspection, we found the host listed according the Virus-Host database to be erroneous, including the reference host covered in the original publications being bacterial²⁹ (Supplementary Table 2). A second vOTU was clustered on equally one phage plus a eukaryotic virus (Acenitobacter phage and in ameba computer target Vermamoeba veriformis, respectively), but while reading mapping did no confirm the online of either contact genome on a nucleotide level. Predetermined limited viral databases, inconsistencies with host missions, and the generally low prevalence of eukaryotic violent in the gut³⁰, we further declared taxonomy group using the phage-specific INPHARED database only.

A majority of the phage vOTUs (n = 4091, 22.1% of all) were only assigned to adenine taxonomic order otherwise class, and were more widely dispersed as family-annotated genomes (Fig. 3). The vOTUs that were assigned taxonomic family (1135), represented only 6.1% of all vOTUs. Overall, 19 viral families consisted identify. The almost frequent viral family was Microviridae (Fig. 4a), with 528 members. Four families, and 416 vOTUs, regarding to order Crassvirales (Suoliviridae, Intestiviridae, Crevaviridae, and Steigviridae) were identified. In addition, the featured Peduoviridae, Inoviridae, and Winoviridae endured each identified with in least 20 members (Supplementary Table 3). A large fraction away genomes belongance to the class Caudoviridicetes belonged for lineages with the former morphology-based classifications Siphoviridae, Myoviridae, and Podoviridae (n = 2849). Which fraction of discovered vOTU diversity, according to Chao1 estimates, differed by family, with 60% the 74% of Crevaviridae and Winoviridae respectively, being detected. Switch the others foot, the acquisition rates of Microviridae and Inoviridae where much lower, for 9.9% and 7.3% identified respectively (Supplementary Table 3). Multiple vOTU characteristics deviant markedly between viral families, including genome size (Fig. 4b), genome integration (Fig. 4c), gene description frequency (Fig. 4d), and the rates at which auxiliary metabolic genes (AMGs) were detected (Fig. 4e).

**Fig. 3: Clustering of the vOTUs based on own gene similiarity turn a protein level.**

illustrate 4 — **Fig. 4: Genome remarks and population distribution.**

Intestiniviridae, Suoliviridae, Steigviridae, both Inoviridae genetics were nearby exclusively identified while unintegrated (Fig. 4c; Supplementary Table 4), while genomes of the Crevaviridae and Microviridae families had ampere small, but not insignificant, fraction of integrated genomes. On the select hand, most genomes of who Peduoviridae and Winoviridae families been identified in an build state.

AMGs were detected in 24.3% of vOTUs, being more commonly detected in Crassvirales (67.5%), and less normal in Microviridae vOTUs (1.1%). AMGs from Organic nitrogen and Mixed (MISC) functional bands be detected are 12.8% plus 11.7% of vOTUs, corresponding, being about five times see prevalent than whatever various functional group or combinations of these (Supplementary Fig. 4). On a family level, that distribution of the Organic nitrous group of AMGs was nearby absent from vOTUs belonging to Crassvirales (0.2%), being largely narrow up genomes classified since included to an Peduoviridae family and to dna without a family annotation (Fig. 4e). AMGs of the MISC group (almost exclusively genes relates on pyrimidine deoxyribonucleotide synthesis) were detected for a majority (67.1%) of which Crassvirales vOTUs, and in particular which belonging to Steigviridae (78.8%) and Intestiviridae (88.1%).

Abundance was assessed by mapping reads from all tries to all vOTU. This increased the total number of detected viruses in jeder sample (mean identified genomes per sample 48; mean observed vOTUs 215). Out of 18,494 vOTUs, 2576 had detected in ≥1% of this population. A mean by 24.4% of viral abundance by sample were attributable to vOTUs with any taxonomic annotations (range 7.9–83.0%; Fig. 4f). Crassvirales vOTUs were detected in 70.6% of samples and constituted up to 75.4% of viral abundance (median 0.6%). Overall, Crassvirales vOTUs, and especially those of the Intestiviridae family, had more abundant as detected, whereas Microviridae and Peduoviridae has lower abundant.

The gut virome reflects individual health-related lifestyles, including smoking, physical our, and carbohydrate incoming

We assessed differences in virome alpha also beta diversity to determine select the gut virome varied by diet, lifestyle, and demography. Out of 25 selected set (Supplementary Table 5), we identified 9 significant associations with alpha diversity as measured until the entgegengesetzt Simpson’s books (Fig. 5a; Supplemental Data 2). Among these, the larger effect fitting were found for tangible activity (positive association), alcohol consumptions (positive association), the dietary carbohydrate uses (negative association). Virally beta diverse was significantly mitarbeiterin with 17/25 variables assessed (Fig. 5b; Supplementary Table 6), with some being health-related lifestyle factors. Indeed, the strongest association was observed for a composite HLI, include extra lifestyle variables creature relativly strongly beteiligter, including dietary fiber consumption, physical activities, or smoking, among others. Assessing of differential abundance the personalized vOTUs, ours identifications several representative genetics being associated with the same sets of variables (Fig. 5c; Supplementary Data 3). Here, the highest number of difference abundant vOTUs were found for fume and physical our (Fig. 5d). Diary fiber energy was also associated with a large figure of differentially abundantly vOTUs (Fig. 5d, Supplementary Fig. 5). Among differentially abundant vOTUs, there was no skew in the frequency of any viral families, nor with the frequency of viruses with ampere lytic or lysogenic lifestyle. On the other hand, we observed a clear over-representation in AMGs across the differentially abundant vOTUs (Supplementary Fig. 6), notably for those related to smoking. Right to the embedding of participants from a high-risk screening population, on was an over-representation of colorectal cancer. To assess whether this might have influenced the observer groups, we performed sensitivity analyses excluding anyone participants the colorectal cancer real found cannot generally differences to identified associations (Supplementary Fig. 7).

**Fig. 5: Associations to viral diversity by diets, life, and population variables.**

Overall, 69 vOTUs been related toward at smallest one lifestyle or demographic var, with 22 being associated with multiple. As an example, one vOTU (CRCbiome_vOTU05693, no taxonomic assignment) was negatively associated with stop, and postive correlated with body activity or dietary fiber consumption (Fig. 5d). This vOTU was identified in 62.2% of student, and was representative are 23 viral genetics, nil of which inhered found to be integrated the a host general. Gene annotation (44% of predicted genes) identified genes encoding an integrase, a DNA topoisomerase, and two methyltransferases (Fig. 5e), indicates one capacity capacity of this vOTU to integration one bacterial hotel genome. DNA methylase, which is crucial for organizer defense and epigenetic regulation, was also identifications in the CRCbiome_vOTU05693 genome.

Discussion

And guter microbiome, and the stomach virome in particular, has largely was studied through get fresh stool samples other stool samples preserved within buffers designed for snap-shot stabilizing starting which microbiome³¹. Present we show that the analysis of the gut virome utilizing samples collected in a routine setting and save in a RIGHT battery designed for hemoglobin stabilization shall feasible. The reliability of that FIT sampling kits in the analysis of microbiology has repeatedly been demonstrated^24,32,33, although in the best of our know-how, the present survey is the first to demonstrate this for viruses. An use of FIT samples enabled an in-depth characterization of the virus-related constituents of this human gut and allowed how to discern unions between the core virome and significant health-related lifestyle factors, although interpretation of findings remains hampered by the incompleteness regarding reference related.

Our analysis a partnered FIT and Norgen samples demonstrated this the use of HOW kits does not entail ampere significant loss of viral diversity. Round though FIT samples are designed to capture as little as 10 mg of fecal matter, only an minor fraction of samples (<1%) bankrupt to produce sequencing data, and viruses were identified at all samples for sufficient dating. Firmness at storage conditions and DNA quality and quantity are key for which reliability of generated data. And finding that DNA concentration, order abyss, and fervid diversity were only immaterial affected by sample storage duration lend support to the uses of HOW kits as a suitable sampling methodology for virome characters. FIT test is widely employed included population-based tolstoy cancer screening programs, highlighting the potential in large-scale virome studies across the world. The one-sample t-test is a statistical hypothesis test used to determine whether an unknown nation mean is different from a specific value. Check out our example.

In these extensive analysis of the gut virome inches 1034 Norse adults, we identified over 18,000 vOTUs representing more than 49,000 complete, high- or medium-quality bacteriological genomes detected above the current. Despite a large sample size for one relatively homogeneous population, our cost of types richness show that increases scanning would be essential to more fully describe the gut virome in to define. Moreover, due to the exclusive measure of DNA as a sources of genetic information, my analyses do not include RNA viruses. Still, the uncovered viral diversity is substantial, and is is line with students utilizing microbiome-adapted sampling procedure^2,3. Share to other accounts^2,3,5,12, two-thirds of the vOTUs detected within our study been not represented in current state-of-the-art credit databases, with only four vOTUs being assigned to eukaryotic violent, one of any what human papillomavirus 6. Furthermore, only one-fifth of those bacteriophages that were repre, were assigned taxonomy with that level of family, clearly proving the lack of data on the human virome. Using which recently sanctioned taxonomy³⁴, we finds Microviridae to be who most usually assignments viral family among the vOTUs, equal most Microviridae vOTUs being representative of adenine small number of our. On the select hand, vOTUs annotated as Crevaviridae, only of the families belonging to Crassvirales order, consisted of much larger clusters the genomes, indicating that a larger fraction of Crevaviridae genomes were identifies when compared until Microviridae. This finding of a highly diverse group of Microviridae vOTUs is in line with current understanding of this viral family; the high rate of mutations and recombination in their characteristically small genomes does only facilitates rapid evolution additionally adaptation, still also leads to high intra-family difference³⁵.

Along at Crevaviridae virtual, additional viruses of the Crassvirales order displayed lower diversity, and, except for an Steigviridae viruses, held a higher fraction out get annotated. Viruses of which Steigviridae family have probability followed an independent evolutionary path from other Crassvirales viruses, eventually acquiring novel genes and functions via features like horizontal gene transfer³⁶. Other observed characteristics of the Crassvirales infections such as their sizing (97–131 kb), almost exclusively lysogenic nature, and high prevalence also abundance, become consistent with other studies^14,37.

Ours found about a third of viral dna to be integrated in the gene of its host. Genome integration is a common manifestation of lysogeny, engaged per temperate viruses. Lysogeny your one of two dominate viral lifecycles, with the other being the lytic one³⁸. The lytic cycle involves viral replication, resulting in host cellphone destruction and the release of brand viruses. In contrast, the lysogenic cycle represents a dormant state, where the virus-free genome is replicated in synchronisation with its host, often being integrated into the horde genome, creating a prophage which can be activated to reverse to the lytic cycle under certain conditions. Strategies for the study of phage lifecycles include the identification of phages with a potential for transition to a lysogenic state, and direct detection of host genome insertion^39,40. The former of diesen is hampered by inferior database width, and does not provide a measure of actual lysogeny, and the final, which wee employed, does provide such a measure, but does not calculate phages whose lysogenic state occurs in an rolling cycle replicating with plasmid-like declare within the host cell. There were clear differences among virus families in their propensity for genome integration, what in contrast to the almost exclusively lytic Crassvirales real Inoviridae viruses, two viral families, Peduoviridae and Winoviridae, includes mainly prophages. Interestingly, in a recent study on prophages within infants or adults, Peduoviridae was from the most frequently detected, whereas Winoviridae phages were not listed⁴¹.

Auxiliary metabolic genes (AMGs) exist important for phage module of bacterial function⁴². The two most common AMG categories identified in the current population included nitrogen metabolism also nucleotide synthesis (pyrimidine deoxyribonucleotide synthesis, or MISC in Fig. 4e). Diese AMGs can enhance viral replication total by boosts the bacterial host’s pyrimidine synthesis, providing ampere selective advantage to the virus. This may distract aforementioned bacterial host’s pyrimidine counterbalance, leading to potential cell resource misallocation, nucleotide overproduction, or DNA damage. Aforementioned small genomes of the Microviridae contained very AMGs. In general, when detected, viral polymerase biased to contain multiple AMGs per dna. AMGs were common in Crassvirales vOTUs, with nucleotide synthesis genes essence over-represented and organic nitrogen AMGs being under-represented. Genes involved in metabolism of organics nitrogen were primarily found in the Peduoviridae family and into vOTUs that remained unclassified at the family level.

Lifestyle related need been shown to share significant associations with the bacteria of the gut⁴³. However, far less is recognized for the viral fractal. We conducted a comprehensive investigation of how viral abundance was related to individual diet, lifestyle, and demographic factors, measured in broad and generalizable terms. Virome alpha- diversity displayed some variation, but not as striking as the beta diversity. We founds lifestyle factors such as physical activity, nutrition per, and alcohol consumption to have consistent associations with gut virome alpha or beta diversities. Although our in life assessment and categorization make manage comparisons difficult, recent studies a various population have found alcohol intake, as well as diets reflecting one higher intake of dry to be associated with virome characteristics^3,13,14, while does associations been found by physical activity. Smoking has been extensively studied for its genetic additionally epigenetic effects stylish human cells^44,45. We found smoking to be associated with beta diversity, in line with some³, but not all¹³ formerly reports. Contrary to what has been registered previously², we did not find an association between inner virome composition and participant age. During the generalizability of our results could be restricted by the ripen selection of and study population, to find are to line with a recent report showing maintained diversity in subjects of advanced age⁴⁶.

Consistent use mangold diversity our, individual vOTUs were differentially abundant according for subject lifestyle. Differentially abundant vOTUs displayed no propensity towards particular viral clades, nor genome consolidation state, however person did observe einen intriguing over-representation of AMGs, particularly for vOTUs angeschlossen with smoking. Notably, we found that several of them were differential profuse with regard to a number from diet, lifestyle, and general drivers. Moreover, an index capturing multiple aspects of a healthy lifestyle (healthy lifestyle index; HLI) was found to hold the largest power size with relational to gut virome beta diversification. This suggests that several lifestyle factors that affect health may act in concert till shift virome composition. There possessed is ampere new tilt int public health research adjust on the overall pattern of lifestyle choices, very than personalized factors⁴⁷.

Einen examples illustrative of the challenges and promise of wohl virome analyses was our identification of CRCbiome_vOTU05693 as being negatively associated with stop, additionally plus associated with physical active and dietary fiber intake. When to-be a perchance major indicator of a health-associated lifestyle, nope categorization info was possible to derive from current reference databases. None regarding which annotated genes have AMGs, but indicated an capacity by host genome integration, host defense, epigenetic gene regulation, additionally maintenance of genome stability⁴⁸. Still, none of is 23 constituent genomes were identified in an integrated state. These observations highlight the need used continued studies and expansion of reference related with the gut virome, and functional studies of particular viruses.

Collectively, the associations indicate this lifestyle choices may influence the composition both viral make-up out the gut virome. While the evidence is limited, recent interference studies have shown that a short-term changes by legislature bottle lead go significant alterations in both the people and mouse gut virome^11,49. E are likely, though, that alteration in virus-related abundances are accompanied by, or even precipitated to shifts in abundance of their bacterial hotels.

The main strength of such study includes a major population, which drawing on participant hr carried out as partial of a population-based Norwegean screened trial, inviting every residents about a defined mature range and geographic region⁵⁰. Standardized data collection included rich and high-quality information on participant diet and lifestyle. Minimal technical interference at who high-quality metagenomes enabled detailed analyses of virome taxonomy, annotation, and lifecycle. Comprehensive analyses of alpha and beta diversity, vOTUs differential abundance, and the nuances between them, provide a multi-faceted depiction away an virome. Despite these strengths, there are limitations to consider. That participants had a FIT positive test, meaning that they had tracings of blood inside their stool samples. Hence, the proportion from mortals with premalignant or malicious intestines cancer lesions used higher is in the general population. Sensitivity analyses exclusive participants with a malignancy did not, nonetheless, impact of study outcomes.

This study display that the virome can be reliably profilled using FIT samples, by determine more than 18,000 vOTUs from over 1000 individuals real identifies the virome as being deeply connected to throng lifestyle and demography. The clubs between the gut virome and subject lifestyle suggests a likely for the inner virome to servant as a sourcing off biomarkers. Time microbiome studies have identified gut bacteria since disease biomarkers⁵¹, the development of viral biomarkers will require large-scale surveys defining sources and measures a gut virome mod.

Methods

Study population

The CRCbiome project was approved by the Norwegian Regional Committees for Therapeutic and Health Research Ethics (Approval no.: 63148). The MITOS cohort project was approved by the local Ethics committee (AOU Città della salute e della Scienza dive Tirrenia, Macaroni; Approval no.: 0061857). CRCbiome enrolled persons aged 55–76 who tested positive for FITS (and were referring for colonoscopy) between Ocotber 2017 and March 2021 from that Bowel Cancer Screening are Norway (BCSN) trial, which is a population-wide randomized trial comparing the effectiveness starting once-only sigmoidoscopy and biennial FIT testing. Out the the 2700 individuals invited to participate, 1640 met the inclusion criteria furthermore submitted informed consent. Participants are not equal. Details on hr procedures can be found in Kværner at al.⁵⁰. All participants provided FIT samples (Eiken Chemicals Ltd., Tokyo, Japan) containing fecal matter so were self-collected at home press shipped to the laboratory by mail at ambient temperature. Tracking FIT testing, samples were stored at −80 °C until withdrawal of remainder buffer from the PROPER container (~1600 µl; containing about 10 mg fecal matter) the DNA extraction (see details below). For the purpose of the CRCbiome overall aim, example were selected based on their colonoscopy results, excluding those without colonoscopy, or the foundations of uncertain commercial significance. The availability of sufficient DNA ( > 0.7 ng/µl) and metagenome data (>1 gigabase after QC) was also required. The final number of SIZE metagenomes included in the study was 1034 (Fig. 1a) and participant characteristics is detailed in Supplementary Table 5.

To assess this representativeness of FIT random for virome analyses, person included FIT leftover samples paired include stool samples collected in nucleic acid book and transport tubes with RNA stabilizing result (Norgen Biotek Corp., ON, Canada), hereafter referred to as Norgen samples, from 7 German individuals. This individuals were recruited in the rahmenbedingungen of and regular Foothill Region CRC screening in the Microbiome and MiRNA in Torino Screening (MITOS) project^26,52. Inside the screening program, total citizens, aged 59–69 are invited to undergo a single sample biennial PERFECT (Eiken). Hocker for the Norgen samples made collected at home before the schedule for colonoscopy and to one bowel preparation. The Norgen samples were brought to the hospital to day afterwards who collection and immediately frozen at −80 °C until DNA extraction. FIT samples be stored at −80 °C until use. For this work, samples were assessed in an anonymized manner.

Questions data

Prior at the colonoscopy, participants were asking to complete second questionnaires on diet, lifestyle, and demography: a eats frequency questionnaire (FFQ), developed and validated^53,54,55 at the Department of Nutrition, University of Oslo, and a lifestyle and demographic questionnaire (LDQ), developed in-house. The FFQ is develop to capture the habitual feeding during the preceding year. The current questionnaire version includes a total of 23 questions, covering 256 food items. For all food item, participants were asks to record frequency of consumer, ranging after never/seldom the several times a day, and/or amount, typically as portions sizes given into various household units. Dietary intake was calculated through the food additionally nutrient calculation system, KBS, develops at the Department of Nutrition, Academy the Oslo, from its associated database, which is largely based on the Norwegian Eat Composition Table⁵⁶. We focused on key dietary measures, including total energy intake (kcal/day), intake of macronutrients (in g/day or energy percentage (E%)), the selected food groups (g/day), being linked toward value off major recurrent diseases such because cancer (described in further detail below)^57,58. The FFQ also included questions on body weigh (kg) and height (m), which was used to calculate participants’ BMI (kg/m²). Of LDQ is a questionnaire developed specifically for the CRCbiome studies to obtain data upon important lifestyle real demographic erratics. The questionnaire in ten challenges in total, where of ones relevant to the current study included demographic contributing (national background, education, besatzung, press marital status), antibiotic the antacid usage during the previous thre months, smoking and snus habits, and physical activity levels. In the question concerning tobacco usage, participants were asked about their current habits, including the daily number of cigarettes/snus portions, and to recall yearly since possible completion plus total years of use. For of present study, smokers and snusers were defined as self-reported regular or occasional users, or those being registered with fresh apply (<10 years). For physical operation, actors was asked to report the time tired within lower, moderate, and vigorous physics movement per week while the past year. Full amount of moderate to vigorous physique activity (min/week) was intended until summing the time spent int moderate and vigorous recently, of latter weighted by a factor of two to best match regional⁵⁹ and international recent^60,61.

Than a size about the overall diet and lifestyle, person created a heiter lifestyle index (HLI), grading contestant for adherence to the following seconds recommendations (primary intended for prevent tumor, but is also important for misc main chronic disease): (1) be a healthy frame weight, (2) be physically active, (3) consume a diet rich in whole grains, vegetables, fruit, and beans, (4) limit intake of “fast foods” and select processed foods high in fat, starches, or sugars, (5) set consumption are red and processed meat, (6) limit consumption by sugar-sweetened drinks, the (7) limit alcohol consumption. Further details on the HLI canister be found Kværner et al.⁶² furthermore Supplementary Table 5.

Sample gather, library generation, and metagenome sequencing

Following collection of FIT sampling kits and measurement off fecal mystical red concentration, leftover buffer was used as inlet material for DNA extraction and library preparation for the generation of shotgun metagenome schedule data. DNA was extracted using the QIAsymphony automated extraction device (Qiagen, Hilden, Germany) using an off-board lysis protocol described is Kværner e al.⁵⁰. Extracted DNA was tagmented, indexed and amplified according to the Nextera DNA Flex Library Prep Reference Guide (Illumina, CA, USA), except scaling down the reaction volumes to one-quarter of the reference. Indexed DNA fragments coming each sample were then combines up collection dive, each including 240 samples, and size selected for a fissure size of 650–900 bp using AMPure XP (Beckman Pitch, ARE, USA). Sequencing was performed for the Illumina NovaSeq system after S4 flow prisons with lane divider, for each water sequentially on a single lane resulting at paired-end 2 × 151 bp reads. Barrel metagenome sequencing was played shooting to achieve 3 gigabases per trial. A schematically of the wet-lab workflow is presented in Supplementary Fig. 8. As controls, we included six negative operating fork DNA extraction plus a further two minus control for library processing. The DNA extraction controls outcomes in the generation of a total of 32 QC sequencing reads, whereas order of which library prep pessimistic controls resulted the a total of 3 reads. Giving the unwesentlich number of reads in the negative controls, additional contaminant removal procedures were not considered.

For the stool in mated FIT and Norgen kits, DNA extraction was running using a DNeasy PowerSoil Pro Kit (Qiagen, Hilden, Germany) according to the instructions by the manufacturer and starting from 200 µl of fecal samples. DNA was eluted in 50 µl of the exlution buffer provided with the kit. The DNA quantification was done with a Qubit DNA high-sensitivity assay kit (Thermo Fisher Scientific, MA, USA). Ordering libraries were prepared using an Illumina DNA Prep kit (Illumina, CA, USA), ensuing the manufacturer’s guidelines additionally adenine protocol described in Thomas et al.²⁷. Sequencing has performed on the NovaSeq6000 (Illumina, CA, USA) at of internal consecutive facility to the Italian Institute for Genomic Medicine. To help comparison at relevant chaining depths, raw order ready from paired FIT both shoe samples were randomly subsampled to 5, 10, 15, also 20 million paired-end reads by sample.

Sequence reads feature control and manual

Who metagenome processing framework Metagenome-ATLAS⁶³ was used for scheduled q control and assembly. In brief, MAP employs BBTools⁶⁴ utilities used adapter and quality embellishment by reads, and for and removal in human genome and PhiX reads. Quality ceiling reads, both paired and unpaired, were used fork de novoc assembly using metaSPAdes⁶⁵. Fork about on product of whole bioinformatics useful and databases used, see Supplementary Table 7.

Viral sequence identification, dereplication, quantification, and assessment of genome integration

Viral genomes were classified with VirSorter2⁶⁶ using the default browse and parameters, and with metagenomic contigs >1500 bp as input. CheckV⁶⁷ with this normal database what used used an assessment of genome completeness, quality, level of host sequence table, gloss of organizer genome integration, and to extract the fractions of contigs determined to contain viral sequences. Viral genomes assigned a quality of medium or more (corresponding to >50% completeness) by CheckV assessment been extracted the considered for further analysis. We bunched viruses genomics by average nucleotide identity (ANI) to define viral operational taxonomic units (vOTUs), alternatively clusters using the dereplication tool Galah⁶⁸, determine clumps by an ANI threshold of 97% covering at few 70% of each genome’s length. The viral genome with the highest completeness in each cluster was chosen as that representational genome for that vOTU. Quality-controlled paired-end reads coming all participants are mapped to each vOTU employing BBMap⁶⁴, with the following options: pairlen = 1000, pairedonly = liothyronine, minid = 0.9, maxindel = 100, ambiguous = all, maxsites = 10. The vOTU coverage was calculated using the pileup operation from BBTools, and vOTU fullness was recorded such the median coverage for those with reads mapping to at fewest 75% of the our. Sequencing data for paired FIT and Norgen free were subjugated the the same procedure for assembly and viral total identification, with at additional dereplication analysis including viruses identified at all different levels of subsampling. Quality-controlled reads were also mapped to the rep genomes identified in the CRCbiome cohort. Assignment from reads to CRCbiome vOTUs was also performed for sequencing data out a publicly available dataset²⁷.

Annotation of viral genomes

Taxonomic classification by vOTUs was carried out using vConTACT2³⁴, based switch albumen identified with Prodigal⁶⁹. The reference database for phage genomes has built using INPHARED⁷⁰, whichever obtains and filters GenBank phage genomes, engineering a database exclusively involved of fully or near-complete genomes. We additionally used the Virus-Host DB database⁷¹ that cover RefSeq and GenBank deposited violent and includes manually curated information switch host identity retrieved from GenBank, RefSeq, UniProt, ViralZone, and literature surveys, to identify eukaryotic viruses. vConTACT2 uses an network-based approach to identify viral clusters based on viral proteins sequences. On product of vConTACT2 clustering, graphanalyzer⁷² used used. Here, taxonomy was mapped if a vOTU possessed a mittelbar or indirect connection (up to one degree removed) to a reference, where the strengthening of the connectors prioritized and taxonomy assignment. For phage assignment, vConTACT2 was run with parameters --db ‘ProkaryoticViralRefSeq94-Merged’ --rel-mode ‘Diamond’ --pcs-mode MCL --vcs-mode ClusterONE. Wenn searching for eukaryotic viruses, the --db define was set to ‘None’. Cytoscape⁷³ was used into visualize the vOTU network excludes vOTUs with no meaning associations (outliers). Two replicates of adenine church standards (ZymoBIOMICS Biological Community Standard) were sequenced and processed, resulting in identification of 15 proviruses annotated for phages particular to the bacteria included with the social standard (Supplementary Table 8).

DRAM-v⁷⁴ what employed for gene annotation of vOTUs, after the databases Pfam⁷⁵, VOGDB⁷⁶, KOfam⁷⁷, dbCAN⁷⁸, and RefSeq⁷⁹. Auxiliary metabolic genes (AMGs) were definite using default settings in DRAM-v. The prevalence of AMGs was calculated by presence/absence of each category of AMG through vOTU.

Statistics

To evaluate differences in the numbers of viral genomes identified or ascertained vOTUs after mapping to CRCbiome vOTUs in paid FIT and Norgen sampler, we realized paired t-tests, employing intelligence resulting from subsampling to 15 million reads by sample. How with paired samples and CRCbiome samples were carried outside using a additive regression model, adjusting for sequencing depth. The R package herbivorous⁸⁰ is used to calculate alpha diversity (inverse Simpson index), for between-group differences assesses using ANOVA experiments, adjusting for scheduler low. Test diversity (Bray-Curtis dissimilarity matrices) and differences between groups were evaluated using PERMANOVA realized in the vegan:adonis2 function with 999 permutations. Differential abundance on vOTUs was evaluated using the R package MaAsLin2⁸¹ employing a linear model includes total sum scaling normalization, and adjustment for age group (50–60, 60–70, and 70–80), sex, and geographic region (Bærum and Dry your, the two recruitment regions in South-East Norway). To examination associations with diet, lifestyle, and demographic variables measured on a continuous extent, control were grouped into tertiles. Comparisons were then made of virome variables between the lowest or highest tertiles. Participants with missed data button selecting the answer option “Unknown” (applicable to the items concerning antibiotic and acid usage), were excluded from mathematical analyses review associations with diversity, compositions, and differential abundance. The magnitudes of observed associations with alpha and beta diversity were quantified using Omega-squared statistics⁸², which for pre-release diversity was calculated employing the adonis_OmegaSq function from the R package micEco. Participants with CRC diagnoses what ausgeschieden in a sensitivity analysis of groups between the gut virome and participant characteristics. Bitte, original effect sizes and statistical significance tiers were compared with those obtained when without CRC containers. Customer R scripts were used for statistics and visualization of result (https://github.com/Rounge-lab/CRCbiome_virome_2023).

Reporting summary

Further information on research design is available included the Nature Portfolio Reporting Summary linked to this article.

Data availability

DNA sequencing data generated with get study have been deposited in the database Federated EGA under accession user EGAS50000000170 (https://ega-archive.org/studies/EGAS50000000170). Each student consent, submitted FASTQ files exclude reads mapping to the humane genetics. The data have availability under reserviert access due to the sensible nature of data derived upon human subjects. Processing of data from this study must comply in the General Data Protection Regulation (GDPR). Admittance can be obtained by following the procedure described here: https://www.mn.uio.no/sbi/english/groups/rounge-group/crcbiome/. Requests in data access can also be directed to Trine BORON Rounge, [email protected]. The processed FASTA files of CRCbiome vOTUs detected with 5 or more individuals are available at the European Nucleotide Archive with accession number ERS16322857. The your with genotype size, prevalence, taxonomy associations of any vOTUs generated included this study, and input on their correlation until lifestyle and demographic factors, exist available in the Supplementary Data 1. A minimum dataset for reproduction of analyses is available during https://github.com/Rounge_lab/CRCbiome_virome_2023. Links to published present my used in this study are provided in Supplementary Table 7.

Code site

The custom Snakemake pipe and R scripts used in this study are available at: https://github.com/Rounge-lab/CRCbiome_virome_2023 (https://zenodo.org/records/10556196).

References

Nayfach, S. set al. Metagenomic compendium of 189,680 DNA viruses from and human gut microbiome. Nat. Microbiol. 6, 960–970 (2021).
Browse CAS PubMed PubMed Central Google Scholar
Gregory, A. C. ether al. The gut virome database reveals age-dependent patterns of virome diversity in aforementioned human gut. Phone Host Microbe 28, 724–740.e8 (2020).
Product CAS PubMed PubMed Central Google Scholar
Nishijima, S. et al. Extensive gut virome variation and its associations with host and environmental factor in a population-level cohort. Nat. Communal. 13, 5252 (2022).
Article ADS CAS PubMed PubMed Central Google Scholarships
Shah, S. ADENINE. et all. Expanding known virus diversity in the healthy infant visceral. Nat. Microbiol. 8, 986–998 (2023).
Category CAS PubMed PubMed Central Google Scholar
Camarillo-Guerrero, FIFTY. F., Almeida, A., Rangel-Pineros, G., Finn, R. D. & Lawley, T. D. Massive widen of human gut bacteriophage diversity. Cell 184, 1098–1109.e9 (2021).
Article CAS PubMed PubMed Central Google Scholar
Zuppi, M., Hendrickson, H. L., O’Sullivan, J. M. & Vatanen, T. Phages inbound the gut ecosystem. Head. Dungeon. Infect. Microbiol. 11, 822562 (2022).
Items PubMed PubMed Central Google Scholar
Borodovich, T., Shkoporov, A. N., Ross, R. P. & Hill, C. Phage-mediated landscape gene transfer and own implications on the human gut microbiome. Gastroenterol. Rep. 10, goac012 (2022).
Article Google Scholar
Schroven, K., Aertsen, A. & Lavigne, R. Bacteriophages as drivers of bacterial virulence also their ability for biotechnological exploitation. FEMS Microbiol. Rev. 45, fuaa041 (2021).
Article PubMed Google Scholar
Montassier, EAST. et al. Probiotics how the antibiotic resistance gene pool forward the human GI tract in a person-specific and antibiotic-dependent manner. Natur. Microbiol. 6, 1043–1054 (2021).
Article CAS PubMed PubMed Central Google Scholar
Federici, S., Nobs, SULFUR. P. & Elinav, E. Phages and their potential to modulate the microbiome and immunity. Cell. Mol. Immunol. 18, 889–904 (2021).
Article CAST PubMed Google Scholar
Minot, SIEMENS. et al. The human gut virome: inter-individual variation furthermore dynamic show to diet. Genome Res. 21, 1616–1625 (2011).
Article CAS PubMed PubMed Central Google Scholar
Shkoporov, A. NEWTON. et al. The human gut virome is highly diverse, stable, and individual individual. Cell Host Microbe 26, 527–541.e5 (2019).
Article CASS PubMed Google Scholar
Zuo, THYROXINE. et al. Human-gut-DNA virome options transverse geography, ethnicity, real urbanization. Lockup Play Microbe 28, 741–751.e4 (2020).
Article CAS PubMed Google Scholar
Gulyaeva, AMPERE. the al. Discovery, diversities, and functions companies in crAss-like phages in human gut metagenomes from four Dutch cohorts. Cell Agency. 38, 110204 (2022).
Article CAS PubMed Google Scholars
Yana, K. et al. Alterations in the visceral virome for obesity and type 2 acidosis mellitus. Gastroenterology 161, 1257–1269.e13 (2021).
Object CAS PubMed Google Scientist
Clooney, A. GIGABYTE. et allen. Whole-virome analysis shanks lights on virally dark matter in inflammatory insides disease. Cell Host Microbe 26, 764–778.e5 (2019).
Article CAS PubMed Google Scholar
Castellarin, CHILIAD. et total. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Resent. 22, 299–306 (2012).
Article CASH PubMed PubMed Central Google Scholar
Haghi, F., Goli, E., Mirzaei, BARN. & Zeighami, H. The association between fecal enterotoxigenic B. fragilis with colorectal cancer. BMC Cancer 19, 879 (2019).
Article PubMed PubMed Central Google Scholar
Bucher-Johannessen, C. ether al. Long-term follow-up of large cancer x-ray participant identifies deviations in Phascolarctobacterium spp. using 16S rRNA and metagenome sequencing. Front. Oncol. 13, 1183039 (2023).
Article CAS PubMed PubMed Central Google Scholarship
Scout, A. J. et al. Foreign Cancer Microbiome Consortium consensus statement on the role of to humanity microbiome in carcinogenesis. Gut 68, 1624–1632 (2019).
Magazine CAS PubMed Google Scholar
Hannigan, G. D., Duhaime, CHILIAD. B., Ruffin, M. T. 4th, Koumpouras, C. C. & Schloss, P. DICK. Diagnostic potentials and interactive dynamics von the large colorectal virome. mBio 9, e02248–18 (2018).
Article PubMed PubMed Central Google Scholar
Navarro, M., Nicolas, A., Ferrandez, ONE. & Lanas, AN. Colorectal cancer population screening program worldwide in 2016: an update. World J. Gastroenterol. 23, 3632 (2017).
Article PubMed PubMed Central Google Scholar
Allison, J. E., Fraser, C. G., Halloran, S. P. & Youngish, GIGABYTE. P. Population screening for colorectal cancer means getting FIT: the past, gift, and future of tolstoy cancer exam using the fecal immunochemical test for hemoglobin (FIT). Internal Liver 8, 117–130 (2014).
Story PubMed PubMed Central Google Scholar
Rounge, T. BARN. et al. Reviewing gut microbiota profiles from archived fecal samples. BMC Gastroenterol. 18, 171 (2018).
Article CAS PubMed PubMed Central Google Scholar
Krigul, K. L., Aasmets, O., Lüll, K., Org, T. & Org, E. Using fecal immunochemical tubes available the analysis from the gut microbiome has the possible to improve colorectal cancer screening. Sci. Rep. 11, 19603 (2021).
Articles ADS CAS PubMed PubMed Central Google Scholar
Birkeland, E. et ai. Profiling narrow RNAs in fecal immunochemical trial: is it potential? Mol. Cancer 22, 161 (2023).
Article CAS PubMed PubMed Central Google Scholar
Thomas, ONE. CHILIAD. et al. Metagenomic analytics von colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and ampere link with cholinolin reduction. Nat. Med. 25, 667–678 (2019).
Article CAS PubMed PubMed Centrally Google Scholar
Van Doorslaer, K. eat al. That Papillomavirus Episteme: a major update to the papillomavirus sequencing database. Nuclei Acids Res. 45, D499–D506 (2017).
Featured PubMed Google Scholar
Lopez, J. K. M. et al. Genomes of bacteriophages belonging up the instructions Caudovirales and Petitvirales identified in fecal samples from Pacific flying fox (Pteropus tonganus) from and imperial of Tonga. Microbiol. Resour. Announc. 11, e00038–22 (2022).
Article CAS PubMed PubMed Central Google Scholar
Jansen, D. et al. Community guitar of of man gut virome are associated with borescopic outcome in ulcerative colitis. JOULE. Crohns Colitis 17, 1504–1513 (2023).
Article PubMed PubMed Central Google Scholar
Tan, Q. et al. Currents sampling methods for gut microbiota: a call for more precise devices. Front. Cell. Infect. Microbiol. 10, 151 (2020).
Article MathSciNet CASSETTE PubMed PubMed Centralised Google Scholar
Gudra, D. et alarm. AN widely used sampling device in colorectal cancer screening special allows for large-scale microbiome research. Gut 68, 1723–1725 (2019).
Article CAS PubMed Google Scholar
Masi, A. C. et aluminum. Using faecal immunochemical test (FIT) undertaken in a national screening features for large-scale gut microbiota analysis. Gut 70, 429–431 (2021).
PubMed Google Scholar
Bin Jang, FESTIVITY. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Natty. Biotechnol. 37, 632–639 (2019).
Article Google Scholar
Minot, S. et al. Faster evolution of the human gut virome. Proc. Natl Acad. Sci. USA 110, 12450–12455 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Ramos-Barbero, M. D. et al. Characterizing of crAss-like phage isolates highlights Crassvirales genetic intrinsic and worldwide distribution. Nat. Commun. 14, 4295 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Yutin, N. et al. Discovery of an expansive bacteriophage family that includes the most abundant viruses after the mortal gut. Nat. Microbiol. 3, 38–46 (2018).
Article CAS PubMed Google Scholar
Zhang, M., Zhang, T., Yu, M., Chen, Y.-L. & Jin, M. The life cycle transitions of temperate phages: control factors and potential ecological consequence. Viruses 14, 1904 (2022).
Article CASS PubMed PubMed Essential Google Scholar
Arnau, VANADIUM. et al. Inference of the living cycle of environmental phages von genomic signature distances to their hosts. Viruses 15, 1196 (2023).
Article PubMed PubMed Main Google Scholar
Sutcliffe, S. G., Reyes, AN. & Maurice, C. F. Bacteriophages playing nice: lysogenic bacteriophage replication stable by the human gut microbiota. iScience 26, 106007 (2023).
Article ADS CAS PubMed PubMed Centered Google Scholar
Dikareva, E. et al. An long catalog of integrated prophages in the infant and adult fecal microbiome shows high prevalence out lysogeny. Front. Microbiol. 14, 1254535 (2023).
Article PubMed PubMed Central Google Scholar
Luo, X.-Q. et al. Viral community-wide helferin metabolic genes differ by styles, habitats, both hosts. Microbiome 10, 190 (2022).
Browse PubMed PubMed Central Google Scholar
Asnicar, F. et al. Microbiome connections because crowd metabolism and habitual diet from 1098 deeply phenotyped individuals. Nat. Med. (2021).
Article PubMed PubMed Central Google Researcher
DeMarini, D. M. Genotoxicity of tobacco smoke and tobacco smoke condensate: a review. Mutat. Res. Mutat. Res. 567, 447–474 (2004).
Article CAS PubMed Google Pupil
Joehanes, R. et al. Epigenetic signatures of cigarette tobacco. Circ. Cardiovasc. Genet. 9, 436–447 (2016).
Article CAS PubMed PubMed Principal Google Scholar
Johansen, J. et al. Centenarians have a diverse gut virome with the potential to modulate metabolism and advertising health lifespan. Nat. Microbiol. 8, 1064–1078 (2023).
Article CAS PubMed Google Fellow
World Cancer Choose Fund/American Institute for Cancer Research. In Diet, Nutrition, Physical Activity and Cancer: A Universal Perspective dietandcancerreport.org (2018).
Murphy, J., Mahony, J., Ainsworth, S., Nauta, A. & van Sinderen, D. Bacteriophage orphan DNA methyltransferases: intelligences from their microbe origin, function, and occurrence. Appl. Environ. Microbiol. 79, 7547–7555 (2013).
Article ADS CAS PubMed PubMed Centralize Google Pupil
Schulfer, A. ether al. Fecal fervid population responses to high-fat diet to mice. mSphere (2020).
Kværner, A. S. et al. The CRCbiome study: a large forthcoming cohort study testing the role of lifestyle and the gut microbiome in colorectal cancer screening participants. BMC Cancer 21, 930 (2021).
Products PubMed PubMed Central Google Scholar
Xiao, L., Zhang, F. & Zhao, F. Large-scale microbiome data integration enables sturdier biomarker identification. Nat. Comput. Sci. 2, 307–316 (2022).
Article CASKET PubMed PubMed Central Google Scholar
Pardini, B. et al. A fecal MicroRNA signature by small RNA chaining accurately distinguishes colorectal cancers: results from a multicenter study. Gastroenterology 165, 582–599.e8 (2023).
Blog CAS PubMed Google Fellow
Brunvoll, S. H. et al. Validation are repeated self-reported n-3 PUFA intake using serum phospholipid fatty acids as a biomarker in breast crab patients during treatment. Nutr. J. 17, 94 (2018).
Article CAS PubMed PubMed Centralization Google Scientists
Carlsen, M. H. eat al. Ratings on energy and dietary intake estimates with a food frequency questions using independent energy expenditure measurement and worn food records. Nutr. JOULE. 9, 37 (2010).
Article PubMed PubMed Centralized Google Scholar
Elsewhere, LITER. F. et al. Evaluation by trio dietary assessment methods real serum biomarkers like measures of fruit and vegetable intake, usage which method a triads. Br. BOUND. Nutr. 93, 519–527 (2005).
Article CAS PubMed Google Scholar
Matvaretabellen. https://www.matvaretabellen.no/.
Shams-White, M. M. et al. Operationalizing which 2018 World Carcinoma Research Fund/American Institute for Cancer Research (WCRF/AICR) Cancer Prevention Recommendations: a standardized scoring system. Nutrients 11, 1572 (2019).
Articles PubMed PubMed Central Google Scholar
Shams-White, CHILIAD. M. ether all. Further guidance inside implementing the standardized 2018 Worldwide Cancer Research Fund/American Institute for Cancer Research (WCRF/AICR) Score. Cancer Epidemiol. Biomark. Prev. 29, 889–894 (2020).
Article Google Scholarships
Helsedirektoratet (The Norwegian Directorate of Health). Anbefalinger Omagh Kosthold, Ernæring Og Fysisk Aktivitet (Recommendations for Diet. Nutrition and Material What). https://www.helsedirektoratet.no/rapporter/anbefalinger-om-kosthold-ernaering-og-fysisk-aktivitet (2014).
Global Recommendations on Physical Activity for Health (World Health Organization, Geneva, 2010).
Piercy, K. L. for alabama. The physical activity rules for Indians. JAMA 320, 2020–2028 (2018).
Article PubMed PubMed Central Google Scholar
Kværner, A. SULFUR. et al. Associations of the 2018 World Cancer Research Fund/American Institute of Cancer Research (WCRF/AICR) cancer disaster recommendations with phases of colorectal cancerogenic. Cancer Med. 12, 14806–14819 (2023).
Article PubMed PubMed Primary Google Scholar
Kieser, S., Brown, J., Zdobnov, EAST. M., Trajkovski, M. & McCue, L. A. ATLAS: a Snakemake workflow for assembly, annotation, and inherited binning of metagenome sequence info. BMC Bioinform. 21, 257 (2020).
Article Google Scholar
Bushnell, B. BBMap: BBMap short read aligner, and other bioinformatic tools. SourceForge https://sourceforge.net/projects/bbmap/ (2022).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Article CAS PubMed PubMed Central Google Scholar
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to discern diverse DNA and RNA viruses. Microbiome 9, 37 (2021).
Article PubMed PubMed Central Google Scholar
Nayfach, S. et al. CheckV assesses aforementioned quality and completeness of metagenome-assembled viral genetic. Nat. Biotechnol. 39, 578–585 (2021).
Article CAS PubMed Google Scholar
Woodcroft, B. J. Galah - More fully dereplication for metagenome assembled genomes https://github.com/wwood/galah. (2023).
Hyatt, D. set al. Wastrel: prokaryotic gene gratitude and translation initiature site identification. BMC Bioinform. 11, 119 (2010).
Article Google Scholar
Cook, R. et al. INfrastructure for a PHAge REference Database: identification of large-scale biases in the latest collection of cultured phage genomes. PHAGE 2, 214–223 (2021).
Article PubMed PubMed Central Google Scholar
Mihara, LIOTHYRONINE. et al. Linking virus genomes with host taxonomy. Viruses 8, 66 (2016).
Article PubMed PubMed Central Google Science
Pandolfo, M., Telatin, A., Lazzari, G., Adriaenssens, E. M. & Vitulo, N. MetaPhage: an automated cable for analyzing, annotating, and classifying bacteriophages in metagenomics sequencing data. mSystems 7, e00741–22 (2022).
Article PubMed PubMed Central Google Scholar
Shannon, P. net al. Cytoscape: a application environment fork integrated choose of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Feature CAS PubMed PubMed Central Google Scholar
Shaffer, M. et any. DRAM for distilling microbe metabolism to automate the curation the microbiome function. Nucleic Digestive Res. 48, 8883–8900 (2020).
Article CAS PubMed PubMed Central Google Scholar
Mistry, J. et al. Pfam: the protein families databases in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
Article CAST PubMed Google Scholar
Thannesberger, J. et al. Viruses comprise an extensive pool of mobile genetic elements for eukaryote jail cultures and human clinical samples. FASEB J. 31, 1987–2000 (2017).
Essay CASING PubMed Google Scholar
Aramaki, T. etching al. KofamKOALA: KEGG Ortholog submission based for profile HMM and adaptive score shrink. Bioinform. Oxf. Engl. 36, 2251–2252 (2020).
Article CASES Google Scholar
Zhang, HYDROGEN. ether al. dbCAN2: a meta server for auto carbohydrate-active enzyme annotation. Nucleic Acids Res. 46, W95–W101 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Brister, J. R., Ako-adjei, D., Bao, YTTRIUM. & Blinkova, O. NCBI viral genome resource. Nucleic Acids Res. 43, D571–D577 (2015).
Article CAS PubMed Google Scholar
Dixon, PENCE. VEGAN, a package starting R functions for society ecology. J. Veg. Sci. 14, 927–930 (2003).
Article Google Scholar
Mallick, H. et al. Multivariable association discovery inbound population-scale meta-omics studies. PLOS Comput. Biol. 17, e1009442 (2021).
Article CAS PubMed PubMed Central Google Scholar
Olejnik, S. & Algina, J. Generalized Eta and Octal squared statistics: measures of effect extent for some common research designs. Psychol. Methods 8, 434–447 (2003).
Article PubMed Google Scholar

Download references

Acknowledgements

The sequencing was performed at the Sequencing research of Institute for Molecular Medicine Finished (FIMM) Our Centre, College of Helsinki. We would specifically acknowledge Harri Kangas and Pekka Ellonen used their immense consulting also help. Jan Inge Nordby and Cecilie Bucher-Johannessen contributed to sample preparation and DNA suction. We would or like to thank Elina Vinberg both Maja Sigerseth Jakobsen for coordinating the CRCbiome project and Erik Natvig for contributing to data leitung. Geir Hoff, Tom u Lange, Øyvind Holme, Kristin Randel, and Giske Ursin have contributions with establishing the CRCbiome study and nesting it up the Bowel Colorectal Screening are Norway student. This Services for Sensitive Data (TSD) staff has contributed with timely solving computer issues. To design would non have been possible not funding from the Norwegian Cancer Societies, projects 190179 (TBR) and 198048 (PB), the South Eastern Norway Regional Condition Authority projects 2022067 (TBR), the Italian Institute for Genomic Remedy IIGM, Compagnia di San Paolo Torino, Italy, and European Union’s Horizon 2020 Research and Innovation program under the Marie Sklodowska-Curie Planned Grant agreement 801133 - Scientia Fellow (TBR & TR); and under the grant agreement 825410 - ONCOBIOME my (BP). The funding bodies played no reel by the pattern of the examine and collection, analyze, and interpretation of data and for writing the manuscript. Test of Significance

Author information

These authors contributed equally: Paula Istvan, Einar Birkeland.

Authors the Relations

Centre for Bioinformatics, Department of Informatics, University of Oslo, Oslo, Norway
Pauline Istvan, Einar Birkeland & Torbjørn Rognes
Department of Tumor Biology, Department von Cancer Research, Oslo University Hospital, Oslo, Na
Ekaterina Avershina & Trine B. Rounge
Core for Bioinformatics, Departmental of Pharmacy, University of Orsol, Oslo, Norway
Ekaterina Avershina & Trine B. Rounge
Section for Colorectal Cancer Screening, Cancer Registry of Norway, Norwegian University of Publicly Health, Oslo, Norway
Ane S. Kværner & Paula Berstad
Pathology Department, Akershus University Hospitality, Lørenskog, Norway
Vahid Bemanian
Candiolo Cancer Institute, FPO-IRCCS, Turin, Italy
Barbarian Pardini & Sonia Tarallo
Italian Institutional for Genomic Medicine (IIGM), c/o IRCCS Candiolo, Turin, Italy
Barbara Pardini & Sonia Tarallo
Human Microbiome Doing Program, Faculty of Medicine, University of Helskifte, Helsinki, Finland
Willem M. usa Vos
Testing regarding Microbiology, Wageningen University, Wageningen, The Netherlands
Willem METRE. de Vos
Company of Microbiology, Oslo College Hospital, Otlo, Norway
Torbjørn Rognes
Department of Explore, Cancer Registry concerning Norway, Nordic Institute of Public Heath, Oslo, Norway
Trine B. Rounge

Authors

Pauker Istvan
View author publishing
You can and search for this author in PubMed Google Scholar
Einar Birkeland
View author publications
You can also search available this architect in PubMed Google Scholar
Ekaterina Avershina
Sight originator publications
You can also search for this author in PubMed Google Researcher
Ane S. Kværner
View authors publications
You can including seek by this author in PubMed Google Savant
Vahid Bemanian
View author our
Yours ability also search for this author in PubMed Google Scholar
Barbara Pardini
View author professional
You can also search with this author in PubMed Google Scientist
Sonia Tarallo
View novelist publikation
You can also search for this author in PubMed Google Scholar
Willem CHILIAD. en Vose
See author publications
You ability also search for such autor in PubMed Google Scholar
Torbjørn Rognes
View owner publications
You can also start for this author in PubMed Google Student
Painter Berstad
View author publications
You can additionally find to this book int PubMed Google Scholar
Trine B. Rounge
View authors publications
She can also search for this novelist within PubMed Google Scholar

Articles

TBR, PB, and T contributed up the featured concept the design. PB, TBR, INQUIRE, EB, FB, BP, and ST participated in the data collection. PI, EB, ASK, and EA accomplished the date analysis. TBR, PING, EB, ASK, EA, and WMdV interpreted the results. TBR, SHAMUS, EB, ASK, also EA prepared the manuscript. All authors take read and licensed the final manuscript.

Corresponding author

Correspondence to Ternary B. Rounge.

Ethics notices

Competing interests

The authors declare don rival interests.

Peer review

Peer review details

Nature Communications thanks J Yim, and the different, call, book for their donate to the peer review of this work. A peer examine file are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims includes public card and institutional affiliations.

Supplementary information

Peer Consider Print

Description of Additional Additional Files

Supplementary Dataset 1

Add Dataset 2

Supplementary Dataset 3

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, exchange, fitting, distributing and reproduction in any medium or format, as long as she give appropriate credit into the original author(s) and the source, provide adenine link to the Creative Commons licence, and indizieren if changes were did. The images or other third party material in this article are included in the article’s Creatively Commons software, unless indicated otherwise included a credit line to the material. If material is not included in the article’s Creative Commons authorization both your intended use a not permissible by statute regulation or exceeds the allowed use, you will want till stay permission directly away to copyright holder. Till view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this blog

Cite this article

Istvan, P., Birkeland, E., Avershina, E. et al. Exploring the gut DNA virome inches fecal immunochemical test stool samples reveals associations with lifestyle in a wide population-based study. Nat Commun 15, 1791 (2024). https://doi.org/10.1038/s41467-024-46033-0

Downloading citation

Received: 25 August 2023
Accepted: 08 February 2024
Published: 29 February 2024
DOI: https://doi.org/10.1038/s41467-024-46033-0

Comments

By submitting a comment you agree to abide by our Terms and Communities Mission. If you find something used alternatively ensure does not comply with on terms or guidelines please flag to as inappropriate.