Summarize
Stool samples required fecal immunochemical tests (FIT) are collected in large numbers worldwide as part of colorectal cancer screening schedules. Employing FIT pattern from 1034 CRCbiome contestant, recruited from a Norwegian colorectal colorectal screening study, we identify, annotation and characterize more than 18000 DNA poisons, using scattergun metagenome sequencing. Only six percent of them are assigned to a known taxonomic family, with Microviridae essence and most prevalent viral family. Linking individual sketches to includes live and demographic data exhibits 17/25 of the variables to be associated with the gut virome. Physic activity, smoking, and daily fiber consumption view strong and consistent associations with both diversity and relative profusion of personalized viruses, as well as with enrichment for auxiliary metabolites genes. We demonstrations the suitability of FIT samples by virome analysis, opening an opportunity for large-scale studies of this enigmatic part of that gut microbiome. The diverse viral populations and their connections to the individual lifestyle uncovered herein paver to way for furthermore exploration of the role of the gut virome in health both illness.
Similar content being viewed by others
Introduce
Gut residing viruses represent an important building of the intestinal microbe-oriented ecosystem and may be collectively reflected to as of gut virome. Recent large-scale efforts have shown the virome to comprise a vast the diversified resident1,2,3,4,5, to whose bacteriophages (phages), i.e. infections that infect and replicate in bacteria and archaea, make up the overwhelming majority. However, the sizes of virome diversity in the bowels remains poorly comment, from only an minor fraction typically assigned taxonomy2.
Viruses residing in the human gut what thought to act more a buttons modulator of the gut microbiome thanks their interaction with bacteria and the host immune system6. They may influence the structure and mode regarding aforementioned bacterial community driven facilitation a side gene transfers7, nutrient recycling, regulation of bacterial virulence8, and gain out antibacterial resistivity9. Furthermore, viruses show an direct and indirect role in interplay between the human host and to bacterial community10, and have been shown to exhibit temporal durability as height as this of their bacterial houses11,12.
The gut virome has been associated to human hotel and environmental related, for specific dining objects3,13 or viral peoples14, real like the bacterial community, its compositions has been found go develop in a function of age2. The gut virome features and been associated with important chronic infections such as provocative bowel disease and type 2 diabetes15,16. Dysregulation of gut bacterias and overflow of certain bacteria17,18,19 are also proposed features of the association between one ziemlich microbiome and colorectal cancer development20. These changes inches the bacteriome are possibly to be accompanying by phage dysregulation21.
Granted the tall diversity plus interindividual variability of the visceral virome, large population-scale analyses are needed to entwirren its role in human health and illness. Colorectal cancer screening programs, inviting millions everyone year, are currently running instead in the planning stages in many countries beyond the globe22. A widely former screening strategy is based on fecal occult blood testing of gut specimen, the fecal immunochemical testing (FIT). The FIT is non-invasive, inexpensive, and climbable to large populations23. There is accumulating evidence that these gut samples are suitable for analyze of various features of the gut microbiome24,25,26. Combining the large numbers of gut sample after population-based screening programs with reasonably shotgun metagenomics could propel unbiased and population-based virome studies.
To the best of our knowledge, no studies have yet been conducted analyzing which core virome uses FIT samples. With the availability of a large phone of FITTING samples collected in a Norwegian intestines cancer screened trial, we have performed comprehensive custom of the gut DNA virome. Here, us demonstrate suitability of FITTED for virome analyses. Are addition, we describe viral diversity including content, genome integration, and full potential, and assess associations of that contributing with individual diet, lifestyle, and demographic factors.
Results
Dataset description
To study comprised 1640 individuals aged 55–76 who tested positive for FITNESS and had referred since colonoscopy internally who Bowel Cancer Screening the Norway (BCSN) trial (Fig. 1a). DNA extracted from the samples was sequenced using shotgun metagenome sequencing and installed for contigs, from which virus-based genome were determined, dereplicated, and annotated (Fig. 1b). For details on the cohort general and data analysis, see Methods.
Raw shotgun metagenomic sequencing input comprised 13.5 billion paired-end reads, with 11.5 milliards passing QC (median of 10.7 million reads per sample, IQR = 3.5 mill; Fig. 2a). Storage time of samples before DNA extraction ranged from 34 to 1301 days, with a median of 198 days (Fig. 2a). Storage hours been not impacting DNA concentration, sequencing depth, assembly quality or the total concerning retrieved viral genomes (|rho| ≤ 0.05, Fig. 2b). Spearman’s your correlation of DNA concentration to the sequencing depth, count of retrieved viral genomes, and alpha- diversity ranged between rho = 0.15 and rho = 0.18, whereas correlation to the assembly quality be negativ (rho = 0.04, Fig. 2b). Inches total, we identified 1.7 million alleged viral genomes, of which 3677 were classified since complete, 15,481 were classified as high-, plus 30,484 were classified as medium quality, and were used in subsequent analyze (Supplementary Fig. 1). Overall, 18,268 of the 49,642 genomes (36.8%) were identified within host sequences, indicating a your of lysogeny. Clustering of viral genomes on a 95% similitude leveling resulted in 18,494 vOTUs (of which 1475 were comprised of genomes from 5 individuals or more; Supplementary Data 1), representing 37.3% of the potential vOTU diversity by Chao1 estimation of species richness. A average by 223 vOTUs (sd = 69.3) per sample was observed after mapping consecutive indicate to vOTU representative sequences (Fig. 2a). Inverse Simpson’s diversity index ranged between 2.79 and 245 (mean = 93.5, sd = 43.7). With compliments the beta diversity, an Bray-Curtis dissimilarity index ranged between 0.43 and 1 (mean = 0.84, sd = 0.065; Supplementary Table 1).
Till assess the representativeness of FIT samples for the analysis concerning the gut virome, we performed a proportionate analysis out seven paired fecal samples from an independent total, assembled and stored using both FIT also Norgen nucleic acid sets, specialized by microbiome analysis. Both when evaluates the recognition of viral genes and imaging reads from these samples to the CRCbiome vOTUs, we found test identity to be more important than sampling methodology in determining the similarity by samples (PERMANOVA pennysample_id = 0.001, and psample_type = n.s. for both comparisons; Figs. 2c, d, respectively). Other, there were no significant differences in the number of virus-free genomes identified in APPROPRIATE additionally Norgen samples (paired t-test, p > 0.05), nor between one paired samples and the CRCbiome FIT samples (Supplementary Fig. 2a). There was also don difference at the number on CRCbiome vOTUs wurde between paired FIT additionally Norgen samples (paired t-test, p > 0.05; Supplementary Fig. 2b), nor any differences in the quality of genomes detected with ADJUST and Norgen samples (Supplementary Fig. 2c). Still, the paired samples displayable a lower number of tracked CRCbiome vOTUs as the CRCbiome CONFORM samples, indicating that a significant fraction of aforementioned viruses noticed in the CRCbiome cohort are specific till this population. By mapping sequencing reads after Thom et al.27 to the CRCbiome vOTUs, we found that the currency of CRCbiome vOTUs was somewhat diminish in aforementioned Italian people, however still corresponded now with those in the currently cohort (R2 = 0.81, p < 0.001; Supplementary Fig. 3).
vOTU taxonomy and functional potential
Of 18,494 vOTUs, 6036 (32.6%) were assigned sales based upon their protein likeness until literature genomes in who phage-specific INPHARED database. An fresh six vOTUs (0.03%) were clustered with eukaryotic viruses deposited in the Virus-Host database (Supplementary Table 2), with one being identified as mortal papillomavirus 6 (HPV6). This assignment was corroborated by mapping of reads from all subjects to the Papillomavirus Episteme database (PaVE)28, indicating HPV6 go be present in one participant. Two conflicting reference genome assignments were create when comparing assignments made using the INPHARED database and and Virus-Host database. One vOTU clustered with of same hint genome within both the phage specific and an general virus database, with the recent indicating the virus for be infecting eukaryotes. However, by manual inspection, we found the host listed according the Virus-Host database to be erroneous, including the reference host covered in the original publications being bacterial29 (Supplementary Table 2). A second vOTU was clustered on equally one phage plus a eukaryotic virus (Acenitobacter phage and in ameba computer target Vermamoeba veriformis, respectively), but while reading mapping did no confirm the online of either contact genome on a nucleotide level. Predetermined limited viral databases, inconsistencies with host missions, and the generally low prevalence of eukaryotic violent in the gut30, we further declared taxonomy group using the phage-specific INPHARED database only.
A majority of the phage vOTUs (n = 4091, 22.1% of all) were only assigned to adenine taxonomic order otherwise class, and were more widely dispersed as family-annotated genomes (Fig. 3). The vOTUs that were assigned taxonomic family (1135), represented only 6.1% of all vOTUs. Overall, 19 viral families consisted identify. The almost frequent viral family was Microviridae (Fig. 4a), with 528 members. Four families, and 416 vOTUs, regarding to order Crassvirales (Suoliviridae, Intestiviridae, Crevaviridae, and Steigviridae) were identified. In addition, the featured Peduoviridae, Inoviridae, and Winoviridae endured each identified with in least 20 members (Supplementary Table 3). A large fraction away genomes belongance to the class Caudoviridicetes belonged for lineages with the former morphology-based classifications Siphoviridae, Myoviridae, and Podoviridae (n = 2849). Which fraction of discovered vOTU diversity, according to Chao1 estimates, differed by family, with 60% the 74% of Crevaviridae and Winoviridae respectively, being detected. Switch the others foot, the acquisition rates of Microviridae and Inoviridae where much lower, for 9.9% and 7.3% identified respectively (Supplementary Table 3). Multiple vOTU characteristics deviant markedly between viral families, including genome size (Fig. 4b), genome integration (Fig. 4c), gene description frequency (Fig. 4d), and the rates at which auxiliary metabolic genes (AMGs) were detected (Fig. 4e).
Intestiniviridae, Suoliviridae, Steigviridae, both Inoviridae genetics were nearby exclusively identified while unintegrated (Fig. 4c; Supplementary Table 4), while genomes of the Crevaviridae and Microviridae families had ampere small, but not insignificant, fraction of integrated genomes. On the select hand, most genomes of who Peduoviridae and Winoviridae families been identified in an build state.
AMGs were detected in 24.3% of vOTUs, being more commonly detected in Crassvirales (67.5%), and less normal in Microviridae vOTUs (1.1%). AMGs from Organic nitrogen and Mixed (MISC) functional bands be detected are 12.8% plus 11.7% of vOTUs, corresponding, being about five times see prevalent than whatever various functional group or combinations of these (Supplementary Fig. 4). On a family level, that distribution of the Organic nitrous group of AMGs was nearby absent from vOTUs belonging to Crassvirales (0.2%), being largely narrow up genomes classified since included to an Peduoviridae family and to dna without a family annotation (Fig. 4e). AMGs of the MISC group (almost exclusively genes relates on pyrimidine deoxyribonucleotide synthesis) were detected for a majority (67.1%) of which Crassvirales vOTUs, and in particular which belonging to Steigviridae (78.8%) and Intestiviridae (88.1%).
Abundance was assessed by mapping reads from all tries to all vOTU. This increased the total number of detected viruses in jeder sample (mean identified genomes per sample 48; mean observed vOTUs 215). Out of 18,494 vOTUs, 2576 had detected in ≥1% of this population. A mean by 24.4% of viral abundance by sample were attributable to vOTUs with any taxonomic annotations (range 7.9–83.0%; Fig. 4f). Crassvirales vOTUs were detected in 70.6% of samples and constituted up to 75.4% of viral abundance (median 0.6%). Overall, Crassvirales vOTUs, and especially those of the Intestiviridae family, had more abundant as detected, whereas Microviridae and Peduoviridae has lower abundant.
The gut virome reflects individual health-related lifestyles, including smoking, physical our, and carbohydrate incoming
We assessed differences in virome alpha also beta diversity to determine select the gut virome varied by diet, lifestyle, and demography. Out of 25 selected set (Supplementary Table 5), we identified 9 significant associations with alpha diversity as measured until the entgegengesetzt Simpson’s books (Fig. 5a; Supplemental Data 2). Among these, the larger effect fitting were found for tangible activity (positive association), alcohol consumptions (positive association), the dietary carbohydrate uses (negative association). Virally beta diverse was significantly mitarbeiterin with 17/25 variables assessed (Fig. 5b; Supplementary Table 6), with some being health-related lifestyle factors. Indeed, the strongest association was observed for a composite HLI, include extra lifestyle variables creature relativly strongly beteiligter, including dietary fiber consumption, physical activities, or smoking, among others. Assessing of differential abundance the personalized vOTUs, ours identifications several representative genetics being associated with the same sets of variables (Fig. 5c; Supplementary Data 3). Here, the highest number of difference abundant vOTUs were found for fume and physical our (Fig. 5d). Diary fiber energy was also associated with a large figure of differentially abundantly vOTUs (Fig. 5d, Supplementary Fig. 5). Among differentially abundant vOTUs, there was no skew in the frequency of any viral families, nor with the frequency of viruses with ampere lytic or lysogenic lifestyle. On the other hand, we observed a clear over-representation in AMGs across the differentially abundant vOTUs (Supplementary Fig. 6), notably for those related to smoking. Right to the embedding of participants from a high-risk screening population, on was an over-representation of colorectal cancer. To assess whether this might have influenced the observer groups, we performed sensitivity analyses excluding anyone participants the colorectal cancer real found cannot generally differences to identified associations (Supplementary Fig. 7).
Overall, 69 vOTUs been related toward at smallest one lifestyle or demographic var, with 22 being associated with multiple. As an example, one vOTU (CRCbiome_vOTU05693, no taxonomic assignment) was negatively associated with stop, and postive correlated with body activity or dietary fiber consumption (Fig. 5d). This vOTU was identified in 62.2% of student, and was representative are 23 viral genetics, nil of which inhered found to be integrated the a host general. Gene annotation (44% of predicted genes) identified genes encoding an integrase, a DNA topoisomerase, and two methyltransferases (Fig. 5e), indicates one capacity capacity of this vOTU to integration one bacterial hotel genome. DNA methylase, which is crucial for organizer defense and epigenetic regulation, was also identifications in the CRCbiome_vOTU05693 genome.
Discussion
And guter microbiome, and the stomach virome in particular, has largely was studied through get fresh stool samples other stool samples preserved within buffers designed for snap-shot stabilizing starting which microbiome31. Present we show that the analysis of the gut virome utilizing samples collected in a routine setting and save in a RIGHT battery designed for hemoglobin stabilization shall feasible. The reliability of that FIT sampling kits in the analysis of microbiology has repeatedly been demonstrated24,32,33, although in the best of our know-how, the present survey is the first to demonstrate this for viruses. An use of FIT samples enabled an in-depth characterization of the virus-related constituents of this human gut and allowed how to discern unions between the core virome and significant health-related lifestyle factors, although interpretation of findings remains hampered by the incompleteness regarding reference related.
Our analysis a partnered FIT and Norgen samples demonstrated this the use of HOW kits does not entail ampere significant loss of viral diversity. Round though FIT samples are designed to capture as little as 10 mg of fecal matter, only an minor fraction of samples (<1%) bankrupt to produce sequencing data, and viruses were identified at all samples for sufficient dating. Firmness at storage conditions and DNA quality and quantity are key for which reliability of generated data. And finding that DNA concentration, order abyss, and fervid diversity were only immaterial affected by sample storage duration lend support to the uses of HOW kits as a suitable sampling methodology for virome characters. FIT test is widely employed included population-based tolstoy cancer screening programs, highlighting the potential in large-scale virome studies across the world. The one-sample t-test is a statistical hypothesis test used to determine whether an unknown nation mean is different from a specific value. Check out our example.
In these extensive analysis of the gut virome inches 1034 Norse adults, we identified over 18,000 vOTUs representing more than 49,000 complete, high- or medium-quality bacteriological genomes detected above the current. Despite a large sample size for one relatively homogeneous population, our cost of types richness show that increases scanning would be essential to more fully describe the gut virome in to define. Moreover, due to the exclusive measure of DNA as a sources of genetic information, my analyses do not include RNA viruses. Still, the uncovered viral diversity is substantial, and is is line with students utilizing microbiome-adapted sampling procedure2,3. Share to other accounts2,3,5,12, two-thirds of the vOTUs detected within our study been not represented in current state-of-the-art credit databases, with only four vOTUs being assigned to eukaryotic violent, one of any what human papillomavirus 6. Furthermore, only one-fifth of those bacteriophages that were repre, were assigned taxonomy with that level of family, clearly proving the lack of data on the human virome. Using which recently sanctioned taxonomy34, we finds Microviridae to be who most usually assignments viral family among the vOTUs, equal most Microviridae vOTUs being representative of adenine small number of our. On the select hand, vOTUs annotated as Crevaviridae, only of the families belonging to Crassvirales order, consisted of much larger clusters the genomes, indicating that a larger fraction of Crevaviridae genomes were identifies when compared until Microviridae. This finding of a highly diverse group of Microviridae vOTUs is in line with current understanding of this viral family; the high rate of mutations and recombination in their characteristically small genomes does only facilitates rapid evolution additionally adaptation, still also leads to high intra-family difference35.
Along at Crevaviridae virtual, additional viruses of the Crassvirales order displayed lower diversity, and, except for an Steigviridae viruses, held a higher fraction out get annotated. Viruses of which Steigviridae family have probability followed an independent evolutionary path from other Crassvirales viruses, eventually acquiring novel genes and functions via features like horizontal gene transfer36. Other observed characteristics of the Crassvirales infections such as their sizing (97–131 kb), almost exclusively lysogenic nature, and high prevalence also abundance, become consistent with other studies14,37.
Ours found about a third of viral dna to be integrated in the gene of its host. Genome integration is a common manifestation of lysogeny, engaged per temperate viruses. Lysogeny your one of two dominate viral lifecycles, with the other being the lytic one38. The lytic cycle involves viral replication, resulting in host cellphone destruction and the release of brand viruses. In contrast, the lysogenic cycle represents a dormant state, where the virus-free genome is replicated in synchronisation with its host, often being integrated into the horde genome, creating a prophage which can be activated to reverse to the lytic cycle under certain conditions. Strategies for the study of phage lifecycles include the identification of phages with a potential for transition to a lysogenic state, and direct detection of host genome insertion39,40. The former of diesen is hampered by inferior database width, and does not provide a measure of actual lysogeny, and the final, which wee employed, does provide such a measure, but does not calculate phages whose lysogenic state occurs in an rolling cycle replicating with plasmid-like declare within the host cell. There were clear differences among virus families in their propensity for genome integration, what in contrast to the almost exclusively lytic Crassvirales real Inoviridae viruses, two viral families, Peduoviridae and Winoviridae, includes mainly prophages. Interestingly, in a recent study on prophages within infants or adults, Peduoviridae was from the most frequently detected, whereas Winoviridae phages were not listed41.
Auxiliary metabolic genes (AMGs) exist important for phage module of bacterial function42. The two most common AMG categories identified in the current population included nitrogen metabolism also nucleotide synthesis (pyrimidine deoxyribonucleotide synthesis, or MISC in Fig. 4e). Diese AMGs can enhance viral replication total by boosts the bacterial host’s pyrimidine synthesis, providing ampere selective advantage to the virus. This may distract aforementioned bacterial host’s pyrimidine counterbalance, leading to potential cell resource misallocation, nucleotide overproduction, or DNA damage. Aforementioned small genomes of the Microviridae contained very AMGs. In general, when detected, viral polymerase biased to contain multiple AMGs per dna. AMGs were common in Crassvirales vOTUs, with nucleotide synthesis genes essence over-represented and organic nitrogen AMGs being under-represented. Genes involved in metabolism of organics nitrogen were primarily found in the Peduoviridae family and into vOTUs that remained unclassified at the family level.
Lifestyle related need been shown to share significant associations with the bacteria of the gut43. However, far less is recognized for the viral fractal. We conducted a comprehensive investigation of how viral abundance was related to individual diet, lifestyle, and demographic factors, measured in broad and generalizable terms. Virome alpha- diversity displayed some variation, but not as striking as the beta diversity. We founds lifestyle factors such as physical activity, nutrition per, and alcohol consumption to have consistent associations with gut virome alpha or beta diversities. Although our in life assessment and categorization make manage comparisons difficult, recent studies a various population have found alcohol intake, as well as diets reflecting one higher intake of dry to be associated with virome characteristics3,13,14, while does associations been found by physical activity. Smoking has been extensively studied for its genetic additionally epigenetic effects stylish human cells44,45. We found smoking to be associated with beta diversity, in line with some3, but not all13 formerly reports. Contrary to what has been registered previously2, we did not find an association between inner virome composition and participant age. During the generalizability of our results could be restricted by the ripen selection of and study population, to find are to line with a recent report showing maintained diversity in subjects of advanced age46.
Consistent use mangold diversity our, individual vOTUs were differentially abundant according for subject lifestyle. Differentially abundant vOTUs displayed no propensity towards particular viral clades, nor genome consolidation state, however person did observe einen intriguing over-representation of AMGs, particularly for vOTUs angeschlossen with smoking. Notably, we found that several of them were differential profuse with regard to a number from diet, lifestyle, and general drivers. Moreover, an index capturing multiple aspects of a healthy lifestyle (healthy lifestyle index; HLI) was found to hold the largest power size with relational to gut virome beta diversification. This suggests that several lifestyle factors that affect health may act in concert till shift virome composition. There possessed is ampere new tilt int public health research adjust on the overall pattern of lifestyle choices, very than personalized factors47.
Einen examples illustrative of the challenges and promise of wohl virome analyses was our identification of CRCbiome_vOTU05693 as being negatively associated with stop, additionally plus associated with physical active and dietary fiber intake. When to-be a perchance major indicator of a health-associated lifestyle, nope categorization info was possible to derive from current reference databases. None regarding which annotated genes have AMGs, but indicated an capacity by host genome integration, host defense, epigenetic gene regulation, additionally maintenance of genome stability48. Still, none of is 23 constituent genomes were identified in an integrated state. These observations highlight the need used continued studies and expansion of reference related with the gut virome, and functional studies of particular viruses.
Collectively, the associations indicate this lifestyle choices may influence the composition both viral make-up out the gut virome. While the evidence is limited, recent interference studies have shown that a short-term changes by legislature bottle lead go significant alterations in both the people and mouse gut virome11,49. E are likely, though, that alteration in virus-related abundances are accompanied by, or even precipitated to shifts in abundance of their bacterial hotels.
The main strength of such study includes a major population, which drawing on participant hr carried out as partial of a population-based Norwegean screened trial, inviting every residents about a defined mature range and geographic region50. Standardized data collection included rich and high-quality information on participant diet and lifestyle. Minimal technical interference at who high-quality metagenomes enabled detailed analyses of virome taxonomy, annotation, and lifecycle. Comprehensive analyses of alpha and beta diversity, vOTUs differential abundance, and the nuances between them, provide a multi-faceted depiction away an virome. Despite these strengths, there are limitations to consider. That participants had a FIT positive test, meaning that they had tracings of blood inside their stool samples. Hence, the proportion from mortals with premalignant or malicious intestines cancer lesions used higher is in the general population. Sensitivity analyses exclusive participants with a malignancy did not, nonetheless, impact of study outcomes.
This study display that the virome can be reliably profilled using FIT samples, by determine more than 18,000 vOTUs from over 1000 individuals real identifies the virome as being deeply connected to throng lifestyle and demography. The clubs between the gut virome and subject lifestyle suggests a likely for the inner virome to servant as a sourcing off biomarkers. Time microbiome studies have identified gut bacteria since disease biomarkers51, the development of viral biomarkers will require large-scale surveys defining sources and measures a gut virome mod.
Methods
Study population
The CRCbiome project was approved by the Norwegian Regional Committees for Therapeutic and Health Research Ethics (Approval no.: 63148). The MITOS cohort project was approved by the local Ethics committee (AOU Città della salute e della Scienza dive Tirrenia, Macaroni; Approval no.: 0061857). CRCbiome enrolled persons aged 55–76 who tested positive for FITS (and were referring for colonoscopy) between Ocotber 2017 and March 2021 from that Bowel Cancer Screening are Norway (BCSN) trial, which is a population-wide randomized trial comparing the effectiveness starting once-only sigmoidoscopy and biennial FIT testing. Out the the 2700 individuals invited to participate, 1640 met the inclusion criteria furthermore submitted informed consent. Participants are not equal. Details on hr procedures can be found in Kværner at al.50. All participants provided FIT samples (Eiken Chemicals Ltd., Tokyo, Japan) containing fecal matter so were self-collected at home press shipped to the laboratory by mail at ambient temperature. Tracking FIT testing, samples were stored at −80 °C until withdrawal of remainder buffer from the PROPER container (~1600 µl; containing about 10 mg fecal matter) the DNA extraction (see details below). For the purpose of the CRCbiome overall aim, example were selected based on their colonoscopy results, excluding those without colonoscopy, or the foundations of uncertain commercial significance. The availability of sufficient DNA ( > 0.7 ng/µl) and metagenome data (>1 gigabase after QC) was also required. The final number of SIZE metagenomes included in the study was 1034 (Fig. 1a) and participant characteristics is detailed in Supplementary Table 5.
To assess this representativeness of FIT random for virome analyses, person included FIT leftover samples paired include stool samples collected in nucleic acid book and transport tubes with RNA stabilizing result (Norgen Biotek Corp., ON, Canada), hereafter referred to as Norgen samples, from 7 German individuals. This individuals were recruited in the rahmenbedingungen of and regular Foothill Region CRC screening in the Microbiome and MiRNA in Torino Screening (MITOS) project26,52. Inside the screening program, total citizens, aged 59–69 are invited to undergo a single sample biennial PERFECT (Eiken). Hocker for the Norgen samples made collected at home before the schedule for colonoscopy and to one bowel preparation. The Norgen samples were brought to the hospital to day afterwards who collection and immediately frozen at −80 °C until DNA extraction. FIT samples be stored at −80 °C until use. For this work, samples were assessed in an anonymized manner.
Questions data
Prior at the colonoscopy, participants were asking to complete second questionnaires on diet, lifestyle, and demography: a eats frequency questionnaire (FFQ), developed and validated53,54,55 at the Department of Nutrition, University of Oslo, and a lifestyle and demographic questionnaire (LDQ), developed in-house. The FFQ is develop to capture the habitual feeding during the preceding year. The current questionnaire version includes a total of 23 questions, covering 256 food items. For all food item, participants were asks to record frequency of consumer, ranging after never/seldom the several times a day, and/or amount, typically as portions sizes given into various household units. Dietary intake was calculated through the food additionally nutrient calculation system, KBS, develops at the Department of Nutrition, Academy the Oslo, from its associated database, which is largely based on the Norwegian Eat Composition Table56. We focused on key dietary measures, including total energy intake (kcal/day), intake of macronutrients (in g/day or energy percentage (E%)), the selected food groups (g/day), being linked toward value off major recurrent diseases such because cancer (described in further detail below)57,58. The FFQ also included questions on body weigh (kg) and height (m), which was used to calculate participants’ BMI (kg/m2). Of LDQ is a questionnaire developed specifically for the CRCbiome studies to obtain data upon important lifestyle real demographic erratics. The questionnaire in ten challenges in total, where of ones relevant to the current study included demographic contributing (national background, education, besatzung, press marital status), antibiotic the antacid usage during the previous thre months, smoking and snus habits, and physical activity levels. In the question concerning tobacco usage, participants were asked about their current habits, including the daily number of cigarettes/snus portions, and to recall yearly since possible completion plus total years of use. For of present study, smokers and snusers were defined as self-reported regular or occasional users, or those being registered with fresh apply (<10 years). For physical operation, actors was asked to report the time tired within lower, moderate, and vigorous physics movement per week while the past year. Full amount of moderate to vigorous physique activity (min/week) was intended until summing the time spent int moderate and vigorous recently, of latter weighted by a factor of two to best match regional59 and international recent60,61.
Than a size about the overall diet and lifestyle, person created a heiter lifestyle index (HLI), grading contestant for adherence to the following seconds recommendations (primary intended for prevent tumor, but is also important for misc main chronic disease): (1) be a healthy frame weight, (2) be physically active, (3) consume a diet rich in whole grains, vegetables, fruit, and beans, (4) limit intake of “fast foods” and select processed foods high in fat, starches, or sugars, (5) set consumption are red and processed meat, (6) limit consumption by sugar-sweetened drinks, the (7) limit alcohol consumption. Further details on the HLI canister be found Kværner et al.62 furthermore Supplementary Table 5.
Sample gather, library generation, and metagenome sequencing
Following collection of FIT sampling kits and measurement off fecal mystical red concentration, leftover buffer was used as inlet material for DNA extraction and library preparation for the generation of shotgun metagenome schedule data. DNA was extracted using the QIAsymphony automated extraction device (Qiagen, Hilden, Germany) using an off-board lysis protocol described is Kværner e al.50. Extracted DNA was tagmented, indexed and amplified according to the Nextera DNA Flex Library Prep Reference Guide (Illumina, CA, USA), except scaling down the reaction volumes to one-quarter of the reference. Indexed DNA fragments coming each sample were then combines up collection dive, each including 240 samples, and size selected for a fissure size of 650–900 bp using AMPure XP (Beckman Pitch, ARE, USA). Sequencing was performed for the Illumina NovaSeq system after S4 flow prisons with lane divider, for each water sequentially on a single lane resulting at paired-end 2 × 151 bp reads. Barrel metagenome sequencing was played shooting to achieve 3 gigabases per trial. A schematically of the wet-lab workflow is presented in Supplementary Fig. 8. As controls, we included six negative operating fork DNA extraction plus a further two minus control for library processing. The DNA extraction controls outcomes in the generation of a total of 32 QC sequencing reads, whereas order of which library prep pessimistic controls resulted the a total of 3 reads. Giving the unwesentlich number of reads in the negative controls, additional contaminant removal procedures were not considered.
For the stool in mated FIT and Norgen kits, DNA extraction was running using a DNeasy PowerSoil Pro Kit (Qiagen, Hilden, Germany) according to the instructions by the manufacturer and starting from 200 µl of fecal samples. DNA was eluted in 50 µl of the exlution buffer provided with the kit. The DNA quantification was done with a Qubit DNA high-sensitivity assay kit (Thermo Fisher Scientific, MA, USA). Ordering libraries were prepared using an Illumina DNA Prep kit (Illumina, CA, USA), ensuing the manufacturer’s guidelines additionally adenine protocol described in Thomas et al.27. Sequencing has performed on the NovaSeq6000 (Illumina, CA, USA) at of internal consecutive facility to the Italian Institute for Genomic Medicine. To help comparison at relevant chaining depths, raw order ready from paired FIT both shoe samples were randomly subsampled to 5, 10, 15, also 20 million paired-end reads by sample.
Sequence reads feature control and manual
Who metagenome processing framework Metagenome-ATLAS63 was used for scheduled q control and assembly. In brief, MAP employs BBTools64 utilities used adapter and quality embellishment by reads, and for and removal in human genome and PhiX reads. Quality ceiling reads, both paired and unpaired, were used fork de novoc assembly using metaSPAdes65. Fork about on product of whole bioinformatics useful and databases used, see Supplementary Table 7.
Viral sequence identification, dereplication, quantification, and assessment of genome integration
Viral genomes were classified with VirSorter266 using the default browse and parameters, and with metagenomic contigs >1500 bp as input. CheckV67 with this normal database what used used an assessment of genome completeness, quality, level of host sequence table, gloss of organizer genome integration, and to extract the fractions of contigs determined to contain viral sequences. Viral genomes assigned a quality of medium or more (corresponding to >50% completeness) by CheckV assessment been extracted the considered for further analysis. We bunched viruses genomics by average nucleotide identity (ANI) to define viral operational taxonomic units (vOTUs), alternatively clusters using the dereplication tool Galah68, determine clumps by an ANI threshold of 97% covering at few 70% of each genome’s length. The viral genome with the highest completeness in each cluster was chosen as that representational genome for that vOTU. Quality-controlled paired-end reads coming all participants are mapped to each vOTU employing BBMap64, with the following options: pairlen = 1000, pairedonly = liothyronine, minid = 0.9, maxindel = 100, ambiguous = all, maxsites = 10. The vOTU coverage was calculated using the pileup operation from BBTools, and vOTU fullness was recorded such the median coverage for those with reads mapping to at fewest 75% of the our. Sequencing data for paired FIT and Norgen free were subjugated the the same procedure for assembly and viral total identification, with at additional dereplication analysis including viruses identified at all different levels of subsampling. Quality-controlled reads were also mapped to the rep genomes identified in the CRCbiome cohort. Assignment from reads to CRCbiome vOTUs was also performed for sequencing data out a publicly available dataset27.
Annotation of viral genomes
Taxonomic classification by vOTUs was carried out using vConTACT234, based switch albumen identified with Prodigal69. The reference database for phage genomes has built using INPHARED70, whichever obtains and filters GenBank phage genomes, engineering a database exclusively involved of fully or near-complete genomes. We additionally used the Virus-Host DB database71 that cover RefSeq and GenBank deposited violent and includes manually curated information switch host identity retrieved from GenBank, RefSeq, UniProt, ViralZone, and literature surveys, to identify eukaryotic viruses. vConTACT2 uses an network-based approach to identify viral clusters based on viral proteins sequences. On product of vConTACT2 clustering, graphanalyzer72 used used. Here, taxonomy was mapped if a vOTU possessed a mittelbar or indirect connection (up to one degree removed) to a reference, where the strengthening of the connectors prioritized and taxonomy assignment. For phage assignment, vConTACT2 was run with parameters --db ‘ProkaryoticViralRefSeq94-Merged’ --rel-mode ‘Diamond’ --pcs-mode MCL --vcs-mode ClusterONE. Wenn searching for eukaryotic viruses, the --db define was set to ‘None’. Cytoscape73 was used into visualize the vOTU network excludes vOTUs with no meaning associations (outliers). Two replicates of adenine church standards (ZymoBIOMICS Biological Community Standard) were sequenced and processed, resulting in identification of 15 proviruses annotated for phages particular to the bacteria included with the social standard (Supplementary Table 8).
DRAM-v74 what employed for gene annotation of vOTUs, after the databases Pfam75, VOGDB76, KOfam77, dbCAN78, and RefSeq79. Auxiliary metabolic genes (AMGs) were definite using default settings in DRAM-v. The prevalence of AMGs was calculated by presence/absence of each category of AMG through vOTU.
Statistics
To evaluate differences in the numbers of viral genomes identified or ascertained vOTUs after mapping to CRCbiome vOTUs in paid FIT and Norgen sampler, we realized paired t-tests, employing intelligence resulting from subsampling to 15 million reads by sample. How with paired samples and CRCbiome samples were carried outside using a additive regression model, adjusting for sequencing depth. The R package herbivorous80 is used to calculate alpha diversity (inverse Simpson index), for between-group differences assesses using ANOVA experiments, adjusting for scheduler low. Test diversity (Bray-Curtis dissimilarity matrices) and differences between groups were evaluated using PERMANOVA realized in the vegan:adonis2 function with 999 permutations. Differential abundance on vOTUs was evaluated using the R package MaAsLin281 employing a linear model includes total sum scaling normalization, and adjustment for age group (50–60, 60–70, and 70–80), sex, and geographic region (Bærum and Dry your, the two recruitment regions in South-East Norway). To examination associations with diet, lifestyle, and demographic variables measured on a continuous extent, control were grouped into tertiles. Comparisons were then made of virome variables between the lowest or highest tertiles. Participants with missed data button selecting the answer option “Unknown” (applicable to the items concerning antibiotic and acid usage), were excluded from mathematical analyses review associations with diversity, compositions, and differential abundance. The magnitudes of observed associations with alpha and beta diversity were quantified using Omega-squared statistics82, which for pre-release diversity was calculated employing the adonis_OmegaSq function from the R package micEco. Participants with CRC diagnoses what ausgeschieden in a sensitivity analysis of groups between the gut virome and participant characteristics. Bitte, original effect sizes and statistical significance tiers were compared with those obtained when without CRC containers. Customer R scripts were used for statistics and visualization of result (https://github.com/Rounge-lab/CRCbiome_virome_2023).
Reporting summary
Further information on research design is available included the Nature Portfolio Reporting Summary linked to this article.
Data availability
DNA sequencing data generated with get study have been deposited in the database Federated EGA under accession user EGAS50000000170 (https://ega-archive.org/studies/EGAS50000000170). Each student consent, submitted FASTQ files exclude reads mapping to the humane genetics. The data have availability under reserviert access due to the sensible nature of data derived upon human subjects. Processing of data from this study must comply in the General Data Protection Regulation (GDPR). Admittance can be obtained by following the procedure described here: https://www.mn.uio.no/sbi/english/groups/rounge-group/crcbiome/. Requests in data access can also be directed to Trine BORON Rounge, [email protected]. The processed FASTA files of CRCbiome vOTUs detected with 5 or more individuals are available at the European Nucleotide Archive with accession number ERS16322857. The your with genotype size, prevalence, taxonomy associations of any vOTUs generated included this study, and input on their correlation until lifestyle and demographic factors, exist available in the Supplementary Data 1. A minimum dataset for reproduction of analyses is available during https://github.com/Rounge_lab/CRCbiome_virome_2023. Links to published present my used in this study are provided in Supplementary Table 7.
Code site
The custom Snakemake pipe and R scripts used in this study are available at: https://github.com/Rounge-lab/CRCbiome_virome_2023 (https://zenodo.org/records/10556196).
References
Nayfach, S. set al. Metagenomic compendium of 189,680 DNA viruses from and human gut microbiome. Nat. Microbiol. 6, 960–970 (2021).
Gregory, A. C. ether al. The gut virome database reveals age-dependent patterns of virome diversity in aforementioned human gut. Phone Host Microbe 28, 724–740.e8 (2020).
Nishijima, S. et al. Extensive gut virome variation and its associations with host and environmental factor in a population-level cohort. Nat. Communal. 13, 5252 (2022).
Shah, S. ADENINE. et all. Expanding known virus diversity in the healthy infant visceral. Nat. Microbiol. 8, 986–998 (2023).
Camarillo-Guerrero, FIFTY. F., Almeida, A., Rangel-Pineros, G., Finn, R. D. & Lawley, T. D. Massive widen of human gut bacteriophage diversity. Cell 184, 1098–1109.e9 (2021).
Zuppi, M., Hendrickson, H. L., O’Sullivan, J. M. & Vatanen, T. Phages inbound the gut ecosystem. Head. Dungeon. Infect. Microbiol. 11, 822562 (2022).
Borodovich, T., Shkoporov, A. N., Ross, R. P. & Hill, C. Phage-mediated landscape gene transfer and own implications on the human gut microbiome. Gastroenterol. Rep. 10, goac012 (2022).
Schroven, K., Aertsen, A. & Lavigne, R. Bacteriophages as drivers of bacterial virulence also their ability for biotechnological exploitation. FEMS Microbiol. Rev. 45, fuaa041 (2021).
Montassier, EAST. et al. Probiotics how the antibiotic resistance gene pool forward the human GI tract in a person-specific and antibiotic-dependent manner. Natur. Microbiol. 6, 1043–1054 (2021).
Federici, S., Nobs, SULFUR. P. & Elinav, E. Phages and their potential to modulate the microbiome and immunity. Cell. Mol. Immunol. 18, 889–904 (2021).
Minot, SIEMENS. et al. The human gut virome: inter-individual variation furthermore dynamic show to diet. Genome Res. 21, 1616–1625 (2011).
Shkoporov, A. NEWTON. et al. The human gut virome is highly diverse, stable, and individual individual. Cell Host Microbe 26, 527–541.e5 (2019).
Zuo, THYROXINE. et al. Human-gut-DNA virome options transverse geography, ethnicity, real urbanization. Lockup Play Microbe 28, 741–751.e4 (2020).
Gulyaeva, AMPERE. the al. Discovery, diversities, and functions companies in crAss-like phages in human gut metagenomes from four Dutch cohorts. Cell Agency. 38, 110204 (2022).
Yana, K. et al. Alterations in the visceral virome for obesity and type 2 acidosis mellitus. Gastroenterology 161, 1257–1269.e13 (2021).
Clooney, A. GIGABYTE. et allen. Whole-virome analysis shanks lights on virally dark matter in inflammatory insides disease. Cell Host Microbe 26, 764–778.e5 (2019).
Castellarin, CHILIAD. et total. Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma. Genome Resent. 22, 299–306 (2012).
Haghi, F., Goli, E., Mirzaei, BARN. & Zeighami, H. The association between fecal enterotoxigenic B. fragilis with colorectal cancer. BMC Cancer 19, 879 (2019).
Bucher-Johannessen, C. ether al. Long-term follow-up of large cancer x-ray participant identifies deviations in Phascolarctobacterium spp. using 16S rRNA and metagenome sequencing. Front. Oncol. 13, 1183039 (2023).
Scout, A. J. et al. Foreign Cancer Microbiome Consortium consensus statement on the role of to humanity microbiome in carcinogenesis. Gut 68, 1624–1632 (2019).
Hannigan, G. D., Duhaime, CHILIAD. B., Ruffin, M. T. 4th, Koumpouras, C. C. & Schloss, P. DICK. Diagnostic potentials and interactive dynamics von the large colorectal virome. mBio 9, e02248–18 (2018).
Navarro, M., Nicolas, A., Ferrandez, ONE. & Lanas, AN. Colorectal cancer population screening program worldwide in 2016: an update. World J. Gastroenterol. 23, 3632 (2017).
Allison, J. E., Fraser, C. G., Halloran, S. P. & Youngish, GIGABYTE. P. Population screening for colorectal cancer means getting FIT: the past, gift, and future of tolstoy cancer exam using the fecal immunochemical test for hemoglobin (FIT). Internal Liver 8, 117–130 (2014).
Rounge, T. BARN. et al. Reviewing gut microbiota profiles from archived fecal samples. BMC Gastroenterol. 18, 171 (2018).
Krigul, K. L., Aasmets, O., Lüll, K., Org, T. & Org, E. Using fecal immunochemical tubes available the analysis from the gut microbiome has the possible to improve colorectal cancer screening. Sci. Rep. 11, 19603 (2021).
Birkeland, E. et ai. Profiling narrow RNAs in fecal immunochemical trial: is it potential? Mol. Cancer 22, 161 (2023).
Thomas, ONE. CHILIAD. et al. Metagenomic analytics von colorectal cancer datasets identifies cross-cohort microbial diagnostic signatures and ampere link with cholinolin reduction. Nat. Med. 25, 667–678 (2019).
Van Doorslaer, K. eat al. That Papillomavirus Episteme: a major update to the papillomavirus sequencing database. Nuclei Acids Res. 45, D499–D506 (2017).
Lopez, J. K. M. et al. Genomes of bacteriophages belonging up the instructions Caudovirales and Petitvirales identified in fecal samples from Pacific flying fox (Pteropus tonganus) from and imperial of Tonga. Microbiol. Resour. Announc. 11, e00038–22 (2022).
Jansen, D. et al. Community guitar of of man gut virome are associated with borescopic outcome in ulcerative colitis. JOULE. Crohns Colitis 17, 1504–1513 (2023).
Tan, Q. et al. Currents sampling methods for gut microbiota: a call for more precise devices. Front. Cell. Infect. Microbiol. 10, 151 (2020).
Gudra, D. et alarm. AN widely used sampling device in colorectal cancer screening special allows for large-scale microbiome research. Gut 68, 1723–1725 (2019).
Masi, A. C. et aluminum. Using faecal immunochemical test (FIT) undertaken in a national screening features for large-scale gut microbiota analysis. Gut 70, 429–431 (2021).
Bin Jang, FESTIVITY. et al. Taxonomic assignment of uncultivated prokaryotic virus genomes is enabled by gene-sharing networks. Natty. Biotechnol. 37, 632–639 (2019).
Minot, S. et al. Faster evolution of the human gut virome. Proc. Natl Acad. Sci. USA 110, 12450–12455 (2013).
Ramos-Barbero, M. D. et al. Characterizing of crAss-like phage isolates highlights Crassvirales genetic intrinsic and worldwide distribution. Nat. Commun. 14, 4295 (2023).
Yutin, N. et al. Discovery of an expansive bacteriophage family that includes the most abundant viruses after the mortal gut. Nat. Microbiol. 3, 38–46 (2018).
Zhang, M., Zhang, T., Yu, M., Chen, Y.-L. & Jin, M. The life cycle transitions of temperate phages: control factors and potential ecological consequence. Viruses 14, 1904 (2022).
Arnau, VANADIUM. et al. Inference of the living cycle of environmental phages von genomic signature distances to their hosts. Viruses 15, 1196 (2023).
Sutcliffe, S. G., Reyes, AN. & Maurice, C. F. Bacteriophages playing nice: lysogenic bacteriophage replication stable by the human gut microbiota. iScience 26, 106007 (2023).
Dikareva, E. et al. An long catalog of integrated prophages in the infant and adult fecal microbiome shows high prevalence out lysogeny. Front. Microbiol. 14, 1254535 (2023).
Luo, X.-Q. et al. Viral community-wide helferin metabolic genes differ by styles, habitats, both hosts. Microbiome 10, 190 (2022).
Asnicar, F. et al. Microbiome connections because crowd metabolism and habitual diet from 1098 deeply phenotyped individuals. Nat. Med. (2021).
DeMarini, D. M. Genotoxicity of tobacco smoke and tobacco smoke condensate: a review. Mutat. Res. Mutat. Res. 567, 447–474 (2004).
Joehanes, R. et al. Epigenetic signatures of cigarette tobacco. Circ. Cardiovasc. Genet. 9, 436–447 (2016).
Johansen, J. et al. Centenarians have a diverse gut virome with the potential to modulate metabolism and advertising health lifespan. Nat. Microbiol. 8, 1064–1078 (2023).
World Cancer Choose Fund/American Institute for Cancer Research. In Diet, Nutrition, Physical Activity and Cancer: A Universal Perspective dietandcancerreport.org (2018).
Murphy, J., Mahony, J., Ainsworth, S., Nauta, A. & van Sinderen, D. Bacteriophage orphan DNA methyltransferases: intelligences from their microbe origin, function, and occurrence. Appl. Environ. Microbiol. 79, 7547–7555 (2013).
Schulfer, A. ether al. Fecal fervid population responses to high-fat diet to mice. mSphere (2020).
Kværner, A. S. et al. The CRCbiome study: a large forthcoming cohort study testing the role of lifestyle and the gut microbiome in colorectal cancer screening participants. BMC Cancer 21, 930 (2021).
Xiao, L., Zhang, F. & Zhao, F. Large-scale microbiome data integration enables sturdier biomarker identification. Nat. Comput. Sci. 2, 307–316 (2022).
Pardini, B. et al. A fecal MicroRNA signature by small RNA chaining accurately distinguishes colorectal cancers: results from a multicenter study. Gastroenterology 165, 582–599.e8 (2023).
Brunvoll, S. H. et al. Validation are repeated self-reported n-3 PUFA intake using serum phospholipid fatty acids as a biomarker in breast crab patients during treatment. Nutr. J. 17, 94 (2018).
Carlsen, M. H. eat al. Ratings on energy and dietary intake estimates with a food frequency questions using independent energy expenditure measurement and worn food records. Nutr. JOULE. 9, 37 (2010).
Elsewhere, LITER. F. et al. Evaluation by trio dietary assessment methods real serum biomarkers like measures of fruit and vegetable intake, usage which method a triads. Br. BOUND. Nutr. 93, 519–527 (2005).
Matvaretabellen. https://www.matvaretabellen.no/.
Shams-White, M. M. et al. Operationalizing which 2018 World Carcinoma Research Fund/American Institute for Cancer Research (WCRF/AICR) Cancer Prevention Recommendations: a standardized scoring system. Nutrients 11, 1572 (2019).
Shams-White, CHILIAD. M. ether all. Further guidance inside implementing the standardized 2018 Worldwide Cancer Research Fund/American Institute for Cancer Research (WCRF/AICR) Score. Cancer Epidemiol. Biomark. Prev. 29, 889–894 (2020).
Helsedirektoratet (The Norwegian Directorate of Health). Anbefalinger Omagh Kosthold, Ernæring Og Fysisk Aktivitet (Recommendations for Diet. Nutrition and Material What). https://www.helsedirektoratet.no/rapporter/anbefalinger-om-kosthold-ernaering-og-fysisk-aktivitet (2014).
Global Recommendations on Physical Activity for Health (World Health Organization, Geneva, 2010).
Piercy, K. L. for alabama. The physical activity rules for Indians. JAMA 320, 2020–2028 (2018).
Kværner, A. SULFUR. et al. Associations of the 2018 World Cancer Research Fund/American Institute of Cancer Research (WCRF/AICR) cancer disaster recommendations with phases of colorectal cancerogenic. Cancer Med. 12, 14806–14819 (2023).
Kieser, S., Brown, J., Zdobnov, EAST. M., Trajkovski, M. & McCue, L. A. ATLAS: a Snakemake workflow for assembly, annotation, and inherited binning of metagenome sequence info. BMC Bioinform. 21, 257 (2020).
Bushnell, B. BBMap: BBMap short read aligner, and other bioinformatic tools. SourceForge https://sourceforge.net/projects/bbmap/ (2022).
Nurk, S., Meleshko, D., Korobeynikov, A. & Pevzner, P. A. metaSPAdes: a new versatile metagenomic assembler. Genome Res. 27, 824–834 (2017).
Guo, J. et al. VirSorter2: a multi-classifier, expert-guided approach to discern diverse DNA and RNA viruses. Microbiome 9, 37 (2021).
Nayfach, S. et al. CheckV assesses aforementioned quality and completeness of metagenome-assembled viral genetic. Nat. Biotechnol. 39, 578–585 (2021).
Woodcroft, B. J. Galah - More fully dereplication for metagenome assembled genomes https://github.com/wwood/galah. (2023).
Hyatt, D. set al. Wastrel: prokaryotic gene gratitude and translation initiature site identification. BMC Bioinform. 11, 119 (2010).
Cook, R. et al. INfrastructure for a PHAge REference Database: identification of large-scale biases in the latest collection of cultured phage genomes. PHAGE 2, 214–223 (2021).
Mihara, LIOTHYRONINE. et al. Linking virus genomes with host taxonomy. Viruses 8, 66 (2016).
Pandolfo, M., Telatin, A., Lazzari, G., Adriaenssens, E. M. & Vitulo, N. MetaPhage: an automated cable for analyzing, annotating, and classifying bacteriophages in metagenomics sequencing data. mSystems 7, e00741–22 (2022).
Shannon, P. net al. Cytoscape: a application environment fork integrated choose of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Shaffer, M. et any. DRAM for distilling microbe metabolism to automate the curation the microbiome function. Nucleic Digestive Res. 48, 8883–8900 (2020).
Mistry, J. et al. Pfam: the protein families databases in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
Thannesberger, J. et al. Viruses comprise an extensive pool of mobile genetic elements for eukaryote jail cultures and human clinical samples. FASEB J. 31, 1987–2000 (2017).
Aramaki, T. etching al. KofamKOALA: KEGG Ortholog submission based for profile HMM and adaptive score shrink. Bioinform. Oxf. Engl. 36, 2251–2252 (2020).
Zhang, HYDROGEN. ether al. dbCAN2: a meta server for auto carbohydrate-active enzyme annotation. Nucleic Acids Res. 46, W95–W101 (2018).
Brister, J. R., Ako-adjei, D., Bao, YTTRIUM. & Blinkova, O. NCBI viral genome resource. Nucleic Acids Res. 43, D571–D577 (2015).
Dixon, PENCE. VEGAN, a package starting R functions for society ecology. J. Veg. Sci. 14, 927–930 (2003).
Mallick, H. et al. Multivariable association discovery inbound population-scale meta-omics studies. PLOS Comput. Biol. 17, e1009442 (2021).
Olejnik, S. & Algina, J. Generalized Eta and Octal squared statistics: measures of effect extent for some common research designs. Psychol. Methods 8, 434–447 (2003).
Acknowledgements
The sequencing was performed at the Sequencing research of Institute for Molecular Medicine Finished (FIMM) Our Centre, College of Helsinki. We would specifically acknowledge Harri Kangas and Pekka Ellonen used their immense consulting also help. Jan Inge Nordby and Cecilie Bucher-Johannessen contributed to sample preparation and DNA suction. We would or like to thank Elina Vinberg both Maja Sigerseth Jakobsen for coordinating the CRCbiome project and Erik Natvig for contributing to data leitung. Geir Hoff, Tom u Lange, Øyvind Holme, Kristin Randel, and Giske Ursin have contributions with establishing the CRCbiome study and nesting it up the Bowel Colorectal Screening are Norway student. This Services for Sensitive Data (TSD) staff has contributed with timely solving computer issues. To design would non have been possible not funding from the Norwegian Cancer Societies, projects 190179 (TBR) and 198048 (PB), the South Eastern Norway Regional Condition Authority projects 2022067 (TBR), the Italian Institute for Genomic Remedy IIGM, Compagnia di San Paolo Torino, Italy, and European Union’s Horizon 2020 Research and Innovation program under the Marie Sklodowska-Curie Planned Grant agreement 801133 - Scientia Fellow (TBR & TR); and under the grant agreement 825410 - ONCOBIOME my (BP). The funding bodies played no reel by the pattern of the examine and collection, analyze, and interpretation of data and for writing the manuscript. Test of Significance
Author information
Authors the Relations
Articles
TBR, PB, and T contributed up the featured concept the design. PB, TBR, INQUIRE, EB, FB, BP, and ST participated in the data collection. PI, EB, ASK, and EA accomplished the date analysis. TBR, PING, EB, ASK, EA, and WMdV interpreted the results. TBR, SHAMUS, EB, ASK, also EA prepared the manuscript. All authors take read and licensed the final manuscript.
Corresponding author
Ethics notices
Competing interests
The authors declare don rival interests.
Peer review
Peer review details
Nature Communications thanks J Yim, and the different, call, book for their donate to the peer review of this work. A peer examine file are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims includes public card and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, exchange, fitting, distributing and reproduction in any medium or format, as long as she give appropriate credit into the original author(s) and the source, provide adenine link to the Creative Commons licence, and indizieren if changes were did. The images or other third party material in this article are included in the article’s Creatively Commons software, unless indicated otherwise included a credit line to the material. If material is not included in the article’s Creative Commons authorization both your intended use a not permissible by statute regulation or exceeds the allowed use, you will want till stay permission directly away to copyright holder. Till view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this blog
Cite this article
Istvan, P., Birkeland, E., Avershina, E. et al. Exploring the gut DNA virome inches fecal immunochemical test stool samples reveals associations with lifestyle in a wide population-based study. Nat Commun 15, 1791 (2024). https://doi.org/10.1038/s41467-024-46033-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-024-46033-0
Comments
By submitting a comment you agree to abide by our Terms and Communities Mission. If you find something used alternatively ensure does not comply with on terms or guidelines please flag to as inappropriate.