For a more detailed explanation for this package, this document
will keep update for better understanding the source code. You can also
try the playground I
build to get familier with the algorithm
I’ve written some articles
on Medium, which you can find here to get familiar with
topological data analysis. I’ll be continuously updating my work, and I
welcome any feedback!
This package is based on the
TDAmapper
package by Paul Pearson. You can view the original package here. Since the original package hasn’t been updated in over seven years, this version is focused on optimization. By incorporating vector computation into the Mapper algorithm, this package aims to significantly improve its performance.
Step visualize from
Skaf et al.
Mapper is basically a three-step process:
1. Cover: This step splits the data into overlapping intervals and creates a cover for the data.
2. Cluster: This step clusters the data points in each interval the cover creates.
3. Simplicial Complex: This step combines the two steps above, which connects the data points in the cover to create a simplicial complex.
you can know more about the basic here: Chazal, F., & Michel, B. (2021). An introduction to topological data analysis: fundamental and practical aspects for data scientists. Frontiers in artificial intelligence, 4, 667963.
Besides to the steps above, you can find the following code in the package:
Main Goals 1. Computational
Optimization: The current version speeds up computations by 100
times compare to the original code, and could be faster by using
num_cores
.
<- MapperAlgo(
Mapper filter_values = circle_data[,2:3],
intervals = 4,
percent_overlap = 30,
methods = "dbscan",
method_params = list(eps = 0.3, minPts = 5),
cover_type = 'extension',
num_cores = 12
)MapperPlotter(Mapper, circle_data$circle, circle_data, type = "forceNetwork")
![]() Figure 1 |
![]() Figure 2 |
![]() Figure 3 |
![]() Figure 4 |