LOESS normalization
modelling_norm.Rd
Normalization data by machine learning modelling, eg. locally estimated scatterplot smoothing (LOESS) on QC samples in each batch. For each metabolite, the values (eg. raw peak area data) were divided by the median value of QC samples in that batch. QC samples and metabolite batches should be specified (see parameters below).
Usage
modelling_norm(
object,
method = c("LOESS", "KNN", "XGBoost"),
feature_platform = "PLATFORM",
QC_ID_pattern = "MTRX",
span = 0.75,
degree = 2,
k = 3,
test = FALSE,
verbose = TRUE
)
Arguments
- object
A Metabolite object. In the feature annotation slot `feature`, a platform column should be provided for metabolite measurement platform (eg. `PLATFORM`). The values in the `PLATFORM` column (eg. `Neg`, `Polar`, `Pos Early`, and `Pos Late`) are column names in the sample annotation `sample` to determine the batches of samples.
- method
Modelling method for the normalization, currently support LOESS and KNN.
- feature_platform
The column name of feature platform for metabolite measurements (eg. `PLATFORM`).
- QC_ID_pattern
A character pattern to determine QC samples. Default value: "MTRX".
- span
default value 0.4
- degree
default value 2
- k
Number of neighbors in KNN modelling (default value 3)
- test
test the function for the first 20 columns.
- verbose
print log information.