8 QC & preprocessing
As a first step after importing the data into TreeSE
, one should explore the data and perform quality control (QC). This is important because data quality affects the final results, and failing to assess it accurately can lead to erroneous interpretations. QC and exploration are discussed in Chapter 9.
Based on the QC results, researchers usually apply sample and feature filtering to improve the robustness of the analysis. To focus on a specific taxonomic rank, data agglomeration is commonly performed. Filtering and agglomeration are discussed in detail in Chapter 10 and Chapter 11.
Data transformations, covered in Chapter 12, are applied after filtering. For more information on preprocessing, you can refer to (Zhou et al. 2023), for instance.