Machine Learning


Feature Selection

Feature Selection in Data Mining

In Machine Learning and statistics, feature selection, also known as the variable selection is the operation of specifying a division of applicable features for apply in form of the model formation. The center basis after operating an element collection approach so as to the data hold a number attributes. It is an algorithm can be seen as the grouping of a search procedure for proposes original attribute subsets, along with...

Read More


Semi Supervised Learning Algorithms

Semi-Supervised Learning Models

Semi-Supervised is a category of the Machine Learning approaches and create to control of labeled or unlabeled data for instructions, typically small number of labeled data within a long number of unlabeled data. Semi-Supervised learning fall between unsupervised and supervised knowledge. This approach can be used for traffic identification or classification. This capability suggests traffic classification methods. It depends on single precede information to order...

Read More


Bagging and Boosting

Ensemble Learning approach in Data Mining

In our day to day life, when crucial decisions are made in a meeting, a voting among the members present in the meeting is conducted when the opinions of the members conflict with each other. This principle of “voting” can be applied to data mining also. In the voting scheme, when classifiers are combined, the class assigned to a test instance will be the one...

Read More


KMeans Clustering with Example

KMeans Clustering With Example

Clustering is the process of making a group of abstract objects into classes of similar objects. Having similarity inside clusters to be high and low clustering similarities between the clusters. A cluster of data objects can be treated as one group. While doing cluster analysis, we first partition the set of data into groups based on data similarity and then assign the labels to the...

Read More


PPDM Techniques

Privacy Preserving Data Mining (PPDM) Techniques

Based on the five dimensions explained in the previous blog different PPDM techniques can be categorized into following categories. PPDM is divided into two parts centralized and distributed which is further categorized into 5 techniques. 1. Anonymization Based: Anonymization is a technique in which record owner’s identity or sensitive data remain hidden. In a table, the most basic form of data consists of four types...

Read More


Privacy Preserving Data Mining

Privacy Preserving Data Mining

Data mining is one of the rapidly increasing fields in the computer industry that deals with extracting patterns from large data sets. It is used to extract human understandable information. Moreover, data mining plays an important role in many business organizations, financial, educational and health companies and revealing sensitive information is a big harm. From the point of view of the organization, mining is helpful...

Read More


Topics in Machine Learning Research

Topics in Machine Learning Research

Data mining is an interdisciplinary subfield of computer science It is the computational process of discovering patterns in large data sets involving methods at the intersection of artificial intelligence, machine learning, statistics, and database systems. The overall goal of the data mining process is to extract information from a data set and transform it into an understandable structure for further use.   IEEE 2016-2017 DATA...

Read More