Compare samples in pairs using diversity and number of ASV including shared ASV.

For the moment refseq slot need to be not Null.

Usage

compare_pairs_pq(
  physeq = NULL,
  bifactor = NULL,
  modality = NULL,
  merge_sample_by = NULL,
  nb_min_seq = 0,
  veg_index = "shannon",
  na_remove = TRUE
)

Arguments

physeq: (required): a phyloseq-class object obtained using the phyloseq package.
bifactor: (required) a factor (present in the sam_data slot of the physeq object) presenting the pair names
modality: the name of the column in the sam_data slot of the physeq object to split samples by pairs
merge_sample_by: a vector to determine which samples to merge using the merge_samples2() function. Need to be in physeq@sam_data
nb_min_seq: minimum number of sequences per sample to count the ASV/OTU
veg_index: (default: "shannon") index for the vegan::diversity function
na_remove: (logical, default TRUE) If set to TRUE, remove samples with NA in the variables set in bifactor, modality and merge_sample_by. NA in variables are well managed even if na_remove = FALSE, so na_remove may be useless.

Value

A tibble with information about the number of shared ASV, shared number of sequences and diversity

Examples

data_fungi_low_high <- subset_samples(data_fungi, Height %in% c("Low", "High"))
compare_pairs_pq(data_fungi_low_high, bifactor = "Height", merge_sample_by = "Height")
#> Cleaning suppress 256 taxa and 0 samples.
#> # A tibble: 1 × 13
#>   modality nb_ASV_High nb_ASV_Low nb_shared_ASV div_High div_Low nb_shared_seq
#>   <chr>          <dbl>      <dbl>         <dbl>    <dbl>   <dbl>         <dbl>
#> 1 Height           919        963           718     4.53    4.46        671081
#> # ℹ 6 more variables: percent_shared_seq_High <dbl>,
#> #   percent_shared_seq_Low <dbl>, percent_shared_ASV_High <dbl>,
#> #   percent_shared_ASV_Low <dbl>, ratio_nb_High_Low <dbl>,
#> #   ratio_div_High_Low <dbl>
compare_pairs_pq(data_fungi_low_high,
  bifactor = "Height",
  merge_sample_by = "Height", modality = "Time"
)
#> 11 samples were discarded due to NA in variable modality.
#> Cleaning suppress 293 taxa and 0 samples.
#> # A tibble: 4 × 13
#>   modality nb_ASV_High nb_ASV_Low nb_shared_ASV div_High div_Low nb_shared_seq
#>   <chr>          <dbl>      <dbl>         <dbl>    <dbl>   <dbl>         <dbl>
#> 1 0                324        383           188     3.44    3.11         92766
#> 2 5                488        444           224     3.86    3.73        138450
#> 3 10               239        359           138     2.85    3.35         67475
#> 4 15               365        490           217     3.03    3.46        195794
#> # ℹ 6 more variables: percent_shared_seq_High <dbl>,
#> #   percent_shared_seq_Low <dbl>, percent_shared_ASV_High <dbl>,
#> #   percent_shared_ASV_Low <dbl>, ratio_nb_High_Low <dbl>,
#> #   ratio_div_High_Low <dbl>