Session 2: Metagenomics
In this use case we will use the metagenomics tools included in OmicsBox to analyze the microbial communities of two different soda lakes from Brazil. The original study was carried out by Ana P. D. Andreote, et al., 2018 (doi: 10.3389/fmicb.2018.00244).
Soda lakes are special ecosystems found across Africa, Europe, Asia, etc. These lakes show high levels of sodium carbonate and elevated salinity and pH. Given their special nature, it is interesting to examine their taxonomic composition and functional patterns.
Salina Preta and Salina Verde (Mato Grosso do Sul, Brazil) are two soda lakes that mainly differ by permanent cyanobacterial blooms in Verde, while Preta shows no record of cyanobacterial blooms. Therefore, it can be expected that highly different microbial communities and functional compositions are present between lakes.
The objectives of the study are:
To describe the bacterial diversity between Salina Preta and Salina Verde, and to identify the microorganisms responsible for blooms in Salina Verde.
To identify the functional genetic potential of these microbial communities.
Experimental design and available sequencing data
12 single-end metagenomic samples were considered for the analysis: three replicates for each lake, taken at two different times, morning and afternoon. Samples were collected at two different depths: 0.25 m in Salina Verde, and 0.35 m in Salina Preta.
Lake | Time of Sampling | Sample Names |
Preta | Morning (10 AM) | PMB1, PMB2, PMB3 |
Preta | Afternoon (3 PM) | PAB1, PAB2, PAB3 |
Verde | Morning (10 AM) | VMB1, VMB2, VMB3 |
Verde | Afternoon (3 PM) | VAB1, VAB2, VAB3 |
Use case: Taxonomic Classification with OmicsBox.
To study the taxonomic community compositions of both lakes, we use the taxonomic classification functionality in OmicsBox (Metagenomics > Taxonomic Classification), based on the Kraken, with the preprocessed reads as input.
To explore the classification results, we obtain the plain result table, a PDF report, various charts, and plots.
Exercise 1: Exploring the dataset
Data
Preprocessed Reads (not necessary to complete this exercise)
Questions
Download the experimental design and the taxonomic classification results to respond to these questions:
Was the sequencing depth enough to capture all different OTUs?
Does the number of samples give us a good overview of taxonomic diversity?
Are there clear differences in taxonomic compositions between samples? How do the samples separate?
Are these differences in abundances statistically significant?
Solutions