Introduction: Evolution and Importance of Data Mining-Types of Data and Patterns Mined-Technologies-Applications-Major Issues in Data Mining. Knowing about Data-Data Preprocessing: Cleaning– Integration–Reduction–PCA, Data Transformation and Discretization. Mining Frequent Patterns: Basic Concept – Frequent Item Set Mining Methods – Mining Association Rules – Association to Correlation Analysis.
Classification and Prediction: Issues – Decision Tree Induction – Bayesian Classification – Rule Based Classification – k-Nearest-Neighbor Classification – Linear SVM – Regression – Linear, Logistic – Accuracy and Error measures –Introduction to Ensemble methods
Clustering: Overview of Clustering – Types of Data in Cluster Analysis – Major Clustering Methods-Partitioning Methods- k-Means, k-Medoids. Hierarchical Methods-Agglomerative and Divisive hierarchical clustering. Density-Based Methods-DBSCAN, Graph-based clustering (CHAMELEON), Evaluation in Clustering
Mining Data Streams- Mining Time-Series Data- Mining Sequence Patterns in Biological Data- Graph Mining – Social network Analysis – Text Mining – Mining the World Wide Web, Applications and Trends in Data Mining Tools :Implementation of Data mining algorithms using Latest Open Source Data mining Tools.Tensorflow, python, R
Network security: At application layer – Email, PGP, S/MIME. At transport layer – SSL architecture, handshake protocol, changecipherspec protocol, Alert protocol, Record protocol, SSL message format, Transport layer security. At network layer – modes, security protocols, security associations, security policy, Internet key exchange, ISAKMP.