Privacy Preserving Data Mining
Data mining is one of the rapidly increasing fields in the computer industry that deals with extracting patterns from large data sets. It is used to extract human understandable information. Moreover, data mining plays an important role in many business organizations, financial, educational and health companies and revealing sensitive information is a big harm. From the point of view of the organization, mining is helpful...
Read More
Latest PhD topics in computer science
Latest Ph.D. thesis topics in computer science is all about what practical knowledge you have gained in your B.Tech, M.tech Selecting a decent dissertation topic is significant, as this can offer a powerful foundation upon that to make the remainder of the work. A weak treatise topic can inevitably result in a weak dissertation; one thing that you would like to avoid happening in the...
Read More
Association Rule Mining
Association rules are one of the major techniques of data mining. It finds frequent patterns, associations, correlations or informal structures among sets of items or objects in transactional databases and other information repositories.It is one of the most important data mining tasks, which aims at finding interesting associations and correlation relationships among large sets of data items. A typical example of association rule mining is...
Read More
Sentiment Analysis
Sentiment analysis can be termed as opinion mining. It uses Natural Language Processing (NLP), Computational fundamentals and text analysis to recognize and extract subjective information in source materials. It can also be termed as Review mining and Appraisal Extraction. Synonyms of Opinion The basic task of sentiment analysis is to classify the given text on the basis of polarity at the document level, sentence level...
Read More
Data Classification Parameters
Parameters for |Data Classification Evaluation The parameters for the evaluation of sentiment analysis include various terms. The terms are True positives, true negatives, false negatives and false positives.These are the terms that are used to compare the class labels assigned to documents with the classes the items actually belong to by a classifier.True positive terms are truly classified as positive terms.False positive are not labeled...
Read More
How to prepare dataset in arff and csv format
Machine learning algorithms are primarily designed to work with arrays of numbers. This is called tabular or structured data because it is how data looks in a spreadsheet, comprised of rows and columns. Weka has a specific computer science centric vocabulary when describing data: Instance: A row of data is called an instance, as in an instance or observation from the problem domain. Attribute: A...
Read More
Text Mining
What is Text Mining? Use of computational techniques to extract high-quality information from text Extract and discover knowledge hidden in text automatically KDD definition: “discovery by computer of new previously unknown information, by automatically extracting information from a usually large amount of different unstructured textual resources” Text Mining Categories Document Categorization (Supervised Learning) Document Clustering/Organization (Unsupervised Learning) Summarization (keywords, indices, etc) Visualization (word cloud, maps)...
Read More
Data Mining Tools
Data mining has a wide number of applications ranging from marketing and advertising of goods, services or products, artificial intelligence research, biological sciences, crime investigations to high-level government intelligence. Due to its widespread use and complexity involved in building data mining applications, a large number of Data mining tools have been developed over decades. Every tool has its own advantages and disadvantages. Within data mining,...
Read More