Harsh Malviya

Graduate Data Scientist

Skilled in Data Analysis and Digital Transformation

About Me

Analytical and detail-oriented data science graduate student with 1.5+ years of experience in data analysis, visualization, and machine learning. Skilled in Python, Power BI, and SQL, with a strong track record of developing data-driven solutions, automating workflows, and driving operational efficiency.

Passionate about leveraging advanced technologies like Generative AI and digital twins to transform manufacturing processes and optimize decision-making.

20% efficiency improvement through dashboard creation
15% accuracy improvement in data validation
Recognition for actionable trade data insights

Technical Skills

Programming

Python C/C++ R MySQL MATLAB SAS SQL

Data Tools

Power BI Tableau Qlik Excel Hadoop

AI/ML

Generative AI Predictive Modeling Statistical Analysis Model Evaluation Machine Learning

Testing & Automation

Selenium IDE Workflow Automation Debugging System Administration Test Case Development

Process Improvement

Agile Methods Workflow Optimization Data Cleaning ETL Processes

Soft Skills

Analytical Thinking Problem-Solving Collaboration Communication Time Management

Featured Projects

World Trade Data Report

Applied small-world network theories and machine learning to analyze 30 years of trade data, revealing actionable insights.

Python Machine Learning Network Analysis Data Visualization

Supercomputer Based Spatial Analytics

Evaluated spatial datasets using parallel computing techniques for insights in urban development on the Carnie supercomputer.

Python Parallel Computing Spatial Analysis HPC

Predictive Analytics Dashboard

Used R and Python to model trade data spanning 30 years, uncovering actionable global trends.

R Python Predictive Modeling Dashboard Development

Earthquake Dataset Analysis

Evaluated machine learning models for earthquake prediction, providing insights into seismic hotspots and risk assessment strategies.

Python Machine Learning Geospatial Analysis Risk Assessment

Drawing Web Application

Built a web-based drawing application and automated test cases using Selenium IDE to ensure functionality and performance.

JavaScript Selenium IDE Web Development Automated Testing

Cyber Attack Analysis

Analyzed recent cyber-attacks using data mining algorithms, uncovering business impacts and risk mitigation strategies.

Python Data Mining Cybersecurity Risk Analysis

Research & Analysis

My research focuses on leveraging advanced data science techniques to solve real-world problems. Below is an interactive visualization from my earthquake dataset analysis project, demonstrating my expertise in geospatial data visualization and machine learning applications.

Link to the Research Paper: Earthquake Data Analysis

Earthquake Data Heatmap Analysis

Interactive visualization of global earthquake patterns using machine learning models for seismic hotspot identification and risk assessment.

Key Insights:

  • Identified high-risk seismic zones using clustering algorithms
  • Applied machine learning models for earthquake prediction
  • Developed risk assessment strategies for vulnerable regions
  • Utilized Python and geospatial analysis libraries

Work Experience

AI Engineer Intern

Gogentic AI Texas, USA June 2025 – Present
  • Designed Retrieval-Augmented Generation (RAG) pipelines (Python, LLM's, SQL Server, PostgreSQL) for enhanced analytics of Neurovault datasets
  • Developed digital companion tools for AI-powered meeting summaries with real-time voice-to-text and privacy-aware analytics
  • Integrated Oracle data and automated model workflows to boost intelligence and decision-making

Web Application Architect

Studium Span Madhya Pradesh, India Jul 2023 – Jul 2024
  • Designed and developed SQL-based reports and interactive dashboards using Power BI and Tableau, enhancing operational efficiency by 20%
  • Automated data scrubbing and validation processes, improving system accuracy by 15%
  • Collaborated with cross-functional teams to troubleshoot technical problems and ensure seamless workflows

Web Application Architect Intern

Studium Span Madhya Pradesh, India Jan 2023 – Jun 2023
  • Implemented data integration solutions and built predictive analytics tools using Python and SQL
  • Utilized Power BI to visualize complex data, driving actionable insights for business decisions
  • Authored detailed process documentation to align with data governance standards

Education

Master of Science, Data Science

University of Massachusetts, Dartmouth 2024 – Present GPA: 3.05

Relevant Coursework:

High-Performance Scientific Computing Advanced Mathematical Statistics Small World Networks Advanced Data Mining Software Testing and Automation Business Intelligence and Data Mining

Bachelor of Technology, Computer Science Engineering

Rajiv Gandhi Proudyogiki Vishwavidyalaya, Bhopal, M.P., India 2019 – 2023 GPA: 7.91

Relevant Coursework:

Cloud Computing Database Management Systems Data Mining and Warehousing Machine Learning Computer Networks Analysis Design of Algorithm Object Oriented Programming Internet of Things

Get In Touch

Location

285 Old Westport Rd
Dartmouth, MA, 02747