Experience in Statistical Machine Learning, Data Mining solutions to various business problems and generating data visualizations using R, Python and Tableau.
Expertise in transforming business requirements into analytical models, designing algorithms, building models, developing data mining and reporting solutions that scales across massive volume of structured and unstructured data.
Equipped with experience in utilizing statistical techniques which include Correlation, Hypotheses modelling, Inferential Statistics as well as data mining and modelling techniques using Linear and Logistic regression, clustering, decision trees, and k-mean clustering
Expertise in implementing scalable Statistical & Predictive Decision Science Modelsusing Machine Learning platforms like R & Python Data Science Packages (Scikit-Learn, Pandas, NumPy, SparkR & Spark MLib).
Expertise in building Supervised and Unsupervised Machine Learning experiments using cloud , utilizing multiple algorithms to perform detailed predictive analytics and building Web Services models for all types of data: continuous, nominal, and ordinal.
Knowledge in Google Cloud Platform- Preferred or any other cloud platform like Azure or AWS
In depth knowledge in GCP DataFlow, GCP Dataproc, Data ingestion with GCP Pub/Sub
Knowledge in Visualization tools that have Big Query connectors – Tableau, Google Data studio
Not all of them is mandatory from skillset
Programming & Scripting:
R Programming & R Studio
Azure or Amazon or Google Machine Learning
HDFS, Hive, Spark, Kafka
Microsoft Visual Studio.Net 2010, SQL Server 2008
Techniques & Algorithms:
Data Mining & Cleaning