Skip to contents

This function computes intra-taxanames distances for each taxonomic names (e.g. Genus species) in a phyloseq object containing ASV/OTU sequences and taxonomy.

The distances are computed using the DECIPHER package, which aligns the sequences (`DECIPHER::AlignSeqs()`) and calculates a distance matrix (`DECIPHER::DistanceMatrix()`).

Usage

intra_taxnames_dist(
  physeq,
  taxonomic_rank = c("Genus", "Species"),
  verbose = TRUE,
  verbose_DECIPHER = FALSE,
  discard_NA = TRUE,
  ...
)

Arguments

physeq

A phyloseq object containing ASV/OTU sequences and refseq

taxonomic_rank

Character. Name of the taxonomy column(s) containing taxonomic assignments to compute intra-taxa distances. Can be a vector of two columns (e.g. c("Genus", "Species"), the default).

verbose

Logical. Print progress messages (default: TRUE)

verbose_DECIPHER

Logical. If TRUE, print messages from DECIPHER functions (default: FALSE)

...

Additional arguments to pass to `DECIPHER::AlignSeqs()`

Value

A data.frame with columns: - taxnames: taxonomic names - n_taxa: number of taxa assigned to this taxnames - mean_dist: mean intra-taxanames distance - min_dist: minimum intra-taxanames distance - max_dist: maximum intra-taxanames distance

See also

[DECIPHER::AlignSeqs()], [DECIPHER::DistanceMatrix()]

Author

Adrien Taudiere

Examples

intra_taxn_dist <- intra_taxnames_dist(data_fungi_mini)
#>  Processing Stereum ostrea - 3 taxa
#>  Processing Xylodon raduloides - 3 taxa
#>  Processing Ossicaulis lachnopus - 4 taxa
#>   Stereum hirsutum is represented by only one taxa
#>   Antrodiella brasiliensis is represented by only one taxa
#>   Basidiodendron eyrei is represented by only one taxa
#>   Sistotrema oblongisporum is represented by only one taxa
#>  Processing Fomes fomentarius - 4 taxa
#>   Mycena renati is represented by only one taxa
#>   Helicogloea pellucida is represented by only one taxa
#>   Radulomyces molaris is represented by only one taxa
#>   Elmerina caryae is represented by only one taxa
#>   Phanerochaete livescens is represented by only one taxa
#>   Gloeohypochnicium analogum is represented by only one taxa
#>   Hyphoderma roseocremeum is represented by only one taxa
#>   Hyphoderma setigerum is represented by only one taxa
#>   Trametes versicolor is represented by only one taxa
#>   Peniophora versiformis is represented by only one taxa
#>  Processing Exidia glandulosa - 2 taxa
#>   Peniophorella pubera is represented by only one taxa
#>   Auricularia mesenterica is represented by only one taxa
#>   Marchandiomyces buckii is represented by only one taxa
#>   Hericium coralloides is represented by only one taxa
#>   Xylodon flaviporus is represented by only one taxa
#>  Intra-taxnames distance computation complete
#>  Total taxnames: 24
#>  Taxnames with only one taxa (no distance computation):  19
#>  Taxnames with multiple taxa: 5
#>  Mean intra-taxnames mean distance: 0.006
#>  Mean intra-taxnames maximum distance: 0.0087
#>  Mean intra-taxnames minimum distance: 0.0035
plot(intra_taxn_dist$mean_dist, intra_taxn_dist$n_taxa)

plot(intra_taxn_dist$min_dist, intra_taxn_dist$n_taxa)

plot(intra_taxn_dist$max_dist, intra_taxn_dist$n_taxa)