dbscan - Density-Based Spatial Clustering of Applications with Noise (DBSCAN) and Related Algorithms
A fast reimplementation of several density-based algorithms of the DBSCAN family. Includes the clustering algorithms DBSCAN (density-based spatial clustering of applications with noise) and HDBSCAN (hierarchical DBSCAN), the ordering algorithm OPTICS (ordering points to identify the clustering structure), shared nearest neighbor clustering, and the outlier detection algorithms LOF (local outlier factor) and GLOSH (global-local outlier score from hierarchies). The implementations use the kd-tree data structure (from library ANN) for faster k-nearest neighbor search. An R interface to fast kNN and fixed-radius NN search is also provided. Hahsler, Piekenbrock and Doran (2019) <doi:10.18637/jss.v091.i01>.
Last updated 30 days ago
clusteringdbscandensity-based-clusteringhdbscanlofoptics
294 stars 8.21 score 1 dependencies 79 dependentsTSP - Traveling Salesperson Problem (TSP)
Basic infrastructure and some algorithms for the traveling salesperson problem (also traveling salesman problem; TSP). The package provides some simple algorithms and an interface to the Concorde TSP solver and its implementation of the Chained-Lin-Kernighan heuristic. The code for Concorde itself is not included in the package and has to be obtained separately. Hahsler and Hornik (2007) <doi:10.18637/jss.v023.i02>.
Last updated 3 months ago
concorde-tsp-solvertsp
61 stars 7.25 score 3 dependencies 93 dependentsarules - Mining Association Rules and Frequent Itemsets
Provides the infrastructure for representing, manipulating and analyzing transaction data and patterns (frequent itemsets and association rules). Also provides C implementations of the association mining algorithms Apriori and Eclat. Hahsler, Gruen and Hornik (2005) <doi:10.18637/jss.v014.i15>.
Last updated 1 months ago
arulesassociation-rulesfrequent-itemsets
193 stars 6.95 score 3 dependencies 27 dependentsseriation - Infrastructure for Ordering Objects Using Seriation
Infrastructure for ordering objects with an implementation of several seriation/sequencing/ordination techniques to reorder matrices, dissimilarity matrices, and dendrograms. Also provides (optimally) reordered heatmaps, color images and clustering visualizations like dissimilarity plots, and visual assessment of cluster tendency plots (VAT and iVAT). Hahsler et al (2008) <doi:10.18637/jss.v025.i03>.
Last updated 2 months ago
combinatorial-optimizationordinationseriation
73 stars 6.63 score 17 dependencies 72 dependentsqap - Heuristics for the Quadratic Assignment Problem (QAP)
Implements heuristics for the Quadratic Assignment Problem (QAP). Although, the QAP was introduced as a combinatorial optimization problem for the facility location problem in operations research, it also has many applications in data analysis. The problem is NP-hard and the package implements a simulated annealing heuristic.
Last updated 3 months ago
combinatorial-optimizationheuristicqapquadratic-assignment-problem
4 stars 5.77 score 0 dependencies 74 dependentsrecommenderlab - Lab for Developing and Testing Recommender Algorithms
Provides a research infrastructure to develop and evaluate collaborative filtering recommender algorithms. This includes a sparse representation for user-item matrices, many popular algorithms, top-N recommendations, and cross-validation. Hahsler (2022) <doi:10.48550/arXiv.2205.12371>.
Last updated 4 months ago
collaborative-filteringrecommender-system
211 stars 5.65 score 12 dependencies 2 dependentsrBLAST - R Interface for the Basic Local Alignment Search Tool
Seamlessly interfaces the Basic Local Alignment Search Tool (BLAST) to search genetic sequence data bases. This work was partially supported by grant no. R21HG005912 from the National Human Genome Research Institute.
Last updated 3 months ago
bioconductor-packagebioconductorbioinformaticsblast-search
102 stars 4.31 score 50 dependenciesarulesViz - Visualizing Association Rules and Frequent Itemsets
Extends package 'arules' with various visualization techniques for association rules and itemsets. The package also includes several interactive visualizations for rule exploration. Michael Hahsler (2017) <doi:10.32614/RJ-2017-047>.
Last updated 3 months ago
arulesassociation-rulesfrequent-itemsetsinteractive-visualizationsvisualization
53 stars 4.21 score 103 dependencies 2 dependentsstream - Infrastructure for Data Stream Mining
A framework for data stream modeling and associated data mining tasks such as clustering and classification. The development of this package was supported in part by NSF IIS-0948893, NSF CMMI 1728612, and NIH R21HG005912. Hahsler et al (2017) <doi:10.18637/jss.v076.i14>.
Last updated 1 months ago
data-stream-clusteringdatastreamstream-mining
37 stars 3.55 score 24 dependencies 3 dependentspomdp - Infrastructure for Partially Observable Markov Decision Processes (POMDP)
Provides the infrastructure to define and analyze the solutions of Partially Observable Markov Decision Process (POMDP) models. Interfaces for various exact and approximate solution algorithms are available including value iteration, point-based value iteration and SARSOP. Smallwood and Sondik (1973) <doi:10.1287/opre.21.5.1071>.
Last updated 2 months ago
control-theorymarkov-decision-processesoptimization
14 stars 2.52 score 19 dependenciesstreamMOA - Interface for MOA Stream Clustering Algorithms
Interface for data stream clustering algorithms implemented in the MOA (Massive Online Analysis) framework (Albert Bifet, Geoff Holmes, Richard Kirkby, Bernhard Pfahringer (2010). MOA: Massive Online Analysis, Journal of Machine Learning Research 11: 1601-1604).
Last updated 3 months ago
clusteringdataminingdatastream
12 stars 1.78 score 26 dependenciesarulesCBA - Classification Based on Association Rules
Provides the infrastructure for association rule-based classification including the algorithms CBA, CMAR, CPAR, C4.5, FOIL, PART, PRM, RCAR, and RIPPER to build associative classifiers. Hahsler et al (2019) <doi:10.32614/RJ-2019-048>.
Last updated 2 months ago
association-rulesclassification
2 stars 1.68 score 13 dependencies 1 dependentsrRDP - Interface to the RDP Classifier
This package installs and interfaces the naive Bayesian classifier for 16S rRNA sequences developed by the Ribosomal Database Project (RDP). With this package the classifier trained with the standard training set can be used or a custom classifier can be trained.
Last updated 3 months ago
bioconductor-packagebioconductorbioinformaticsclassification
1 stars 1.64 score 19 dependenciesrRDP - Interface to the RDP Classifier
This package installs and interfaces the naive Bayesian classifier for 16S rRNA sequences developed by the Ribosomal Database Project (RDP). With this package the classifier trained with the standard training set can be used or a custom classifier can be trained.
Last updated 3 months ago
bioconductor-package
1.45 score 19 dependenciespomdpSolve - Interface to 'pomdp-solve' for Partially Observable Markov Decision Processes
Installs an updated version of 'pomdp-solve' and provides a low-level interface. Pomdp-solve is a program to solve Partially Observable Markov Decision Processes (POMDPs) using a variety of exact and approximate value iteration algorithms. A convenient R infrastructure is provided in the separate package pomdp. Kaelbling, Littman and Cassandra (1998) <doi:10.1016/S0004-3702(98)00023-X>.
Last updated 6 months ago
control-theorymarkov-decision-processesoptimization
1 stars 1.21 score 0 dependencies 1 dependentsstreamConnect - Connecting Stream Mining Components Using Sockets and Web Services
Adds functionality to connect stream mining components from package stream using sockets and Web services. The package can be used create distributed workflows and create plumber-based Web services which can be deployed on most common cloud services.
Last updated 1 months ago
2 stars 1.19 score 70 dependenciesarulesNBMiner - Mining NB-Frequent Itemsets and NB-Precise Rules
NBMiner is an implementation of the model-based mining algorithm for mining NB-frequent itemsets and NB-precise rules. Michael Hahsler (2006) <doi:10.1007/s10618-005-0026-2>.
Last updated 2 years ago
association-rules
6 stars 1.19 score 5 dependenciesrMSA - Interface for Popular Multiple Sequence Alignment Tools
Seamlessly interfaces the Multiple Sequence Alignment software packages ClustalW, MAFFT, MUSCLE and Kalign (downloaded separately) and provides support to calcualte distances between sequences. This work was partially supported by grant no. R21HG005912 from the National Human Genome Research Institute.
Last updated 2 months ago
bioinformaticssequence-alignment
9 stars 1.16 score 26 dependenciesrBLAST - R Interface for the Basic Local Alignment Search Tool
Seamlessly interfaces the Basic Local Alignment Search Tool (BLAST) to search genetic sequence data bases. This work was partially supported by grant no. R21HG005912 from the National Human Genome Research Institute.
Last updated 3 months ago
bioconductor-package
1.16 score 50 dependenciesrEMM - Extensible Markov Model for Modelling Temporal Relationships Between Clusters
Implements TRACDS (Temporal Relationships between Clusters for Data Streams), a generalization of Extensible Markov Model (EMM). TRACDS adds a temporal or order model to data stream clustering by superimposing a dynamically adapting Markov Chain. Also provides an implementation of EMM (TRACDS on top of tNN data stream clustering). Development of this package was supported in part by NSF IIS-0948893 and R21HG005912 from the National Human Genome Research Institute. Hahsler and Dunham (2010) <doi:10.18637/jss.v035.i05>.
Last updated 3 months ago
clusteringdata-streamsequence-analysis
1 stars 1.14 score 34 dependenciesarulesSequences - Mining Frequent Sequences
Add-on for arules to handle and mine frequent sequences. Provides interfaces to the C++ implementation of cSPADE by Mohammed J. Zaki.
Last updated 1 years ago
11 stars 1.08 score 4 dependenciescba - Clustering for Business Analytics
Implements clustering techniques such as Proximus and Rock, utility functions for efficient computation of cross distances and data manipulation.
Last updated 2 months ago
1.00 score 1 dependencies 3 dependentsmarkovDP - Infrastructure for Discrete-Time Markov Decision Processes (MDP)
The package provides the infrastructure to work with MDPs in R. The focus is on convenience in formulating MDPs, the support of sparse representations (using sparse matrices, lists and data.frames) and visualization of results. Some key components are implemented in C++ to speed up computation. It also implements several popular solvers.
Last updated 2 months ago
control-theorymarkov-decision-processoptimization
5 stars 0.82 score 16 dependenciesrecommenderlabJester - Jester Dataset for 'recommenderlab'
Provides the Jester Dataset for package recommenderlab.
Last updated 2 years ago
recommender-systems
0.62 score 13 dependenciesrecommenderlabBX - Book-Crossing Dataset (BX) for 'recommenderlab'
Provides the Book-Crossing Dataset for the package recommenderlab.
Last updated 2 years ago
recommender-systems
0.62 score 13 dependencies