Phases of Project : (Modules)
Finding Clusters of transactions using parallel k-means(MapReduce)
Finding Frequent item set of size k using Apriori algorithm(Breadth first approach).
Finding transaction ID list
Find out prefix groups. Vertical Database
Finding frequent itemsets using Eclat algorithm by mining subtree of prefix tree.
Hi,
Myself Ph.D. in advanced analytics having 10+ years of experience in developing and delivering analytical projects using open source R & Hadoop (including in-memory computing), and can deliver your requirement with R or Scala, accompanied by a step-by-step word document, what each line or code means.
I have expertise in Natural language processing, Data mining, Machine learning, R programming, Python, Hadoop, Mapreduce, Hbase, Hive,Image processing, Computer vision etc.