Anirudh Joshi

Software Engineer-1 at HPE

Hewlett Packard Enterprise

About Me

I am working at HPE as a software engineer, I predominently work with cloud technologies such as Kafka, Docker, Kubernetes and various databases like Arango DB, Clickhouse, etc. I am passionate about Deep Learning and NLP.

Interests

LLMs
Deep Learning
Programming
Software Architecture Design

Education

BTECH in CS with specaliztion in Machine Intelligence & Data Science

PES University
12th Grade

AECS Magnolia

Skills

Python

95%

75%

Deep Learning

95%

Data Analysis

95%

Linux

80%

Experience

Software Engineer-1

Hewlett Packard Enterprise

Aug 2023 – Present Bangalore, India

Tasks:

Working on AI-Ops for Next Generation Aruba Central.
Primarily work on writing new features and debugging existing issues in the pipeline

Skills:

Kafka, Python, Java, Maven, Docker, Kubernetes, Linux, Arango, Cassandra, Clickhouse, Argo CD, Jenkins

Software Engineering Intern

Hewlett Packard Enterprise

Jan 2023– July 2023 Bangalore, India

Tasks:

Delivered a project to detect top talking customer devices in clusters, by monitoring network usage in real time over time window of 5 minutes and identify devices that are increasing usage by anomaly detection and statiscal methods to identify increasing usage over past time windows

Skills:

Kafka, Python, Pandas, Statistics, Scikit-learn

Machine Learning Intern

PESU Venture Lab

June 2022– Jan 2023 Bangalore, India

Tasks:

Built a Journal Recommender System for a start-up
Delivered a Resume Parser to parse User resumes and auto-fill
Built a script that finds the most in-demand job skills required for a given job designation by scraping Glassdoor

Skills:

BERT, Ensemble, Xgboost, NLP, Pytorch, Deep Learning, Tensorflow, Python, Pandas, Scikit-learn

Accomplishments

Calculus for Machine Learning & Data Science

Coursera Oct 2023

See certificate

Introduction to Statistics

Coursera Oct 2023

See certificate

Data Science Math Skills

Coursera Oct 2023

See certificate

Deep Learning Specialization [Bundle of 5 courses]

Coursera Jul 2021

See certificate

Machine Learning

Coursera Jun 2020

See certificate

Using Python To Access Web Data

Coursera Aug 2020

See certificate

Projects

Financial Insights Using LLM

Developed a streamlit application that takes a publicly traded company as input, scrapes financial statements from yahoo finance and latest news on that stock from Google and then passed to GPT 3.5 using the OPEN AI API keys.

Link To Code

Chat With Personal Docs using Mistral 7-B

Use Mistral 7-B locally to chat with personal documents in a totally secure manner for free!

Link To Code

QA Dataset Creation using LLM: Question Bank Generator

Create a QA dataset using GPT-3.5 from any pdf by just uploading to my streamlit application. Caution: Project is still under development, exploring Vector DB's to capture larger docs, right now only possible for small docs.

Link To Code

AutoML Website: Athena AI

Developed a webiste for AutoML, where users can upload their dataset and our solution takes care of the rest! The framework automates Data Cleaning, Model Training and provides users with performance metrics for each model so they can choose a model and download it. Site under development but kept private for adding more features, demo will be made available shortly but wont be launching site for a while

Image Captioning

Image Captioning using Inception V3 for feature extraction from images and then used Beam Search - a heuristic based search algorithm to predict the most likely captions.

Link To Code

Resume Parser & Multi-label Classifier

Built a multi-label classifier capable of classifying input resumes into 10 IT roles using BERT and Distilled BERT, achieved a mean accuracy of 96% on each label.

Skills:

Python, Tensorflow, BERT, Distill BERT, NER(Named Entity Recognition), NLP, Deep Learning

Link To Code

Toxic Comments Detection

Built a LSTM+CNN model to detect toxic comments on the Toxic Comments Detection Challenge on Kaggle made use of Wiki-News pretrained word-embeddings and Talos for hyperparameter optimization of the model.

Skills:

Python, Stemming, Lemmatization, Spacy, NLP, LSTM, CNN, Tensorflow, Keras

Link To Code

Stock Price Prediction

Predicting stock prices using Time-Series Forecasting methods and ML models for Reliance stock. The data was taken from NSE(National Stock Exchange) API Library.

Skills:

Python, Data Visualization
Time series forecasting techniques like: SMA (Simple Moving Average), EMA (Exponential Moving Average), MOM (Momentum), RSI (Relative Strength Index), MACD (Moving Average Convergence Divergence)
ML Algorithms like: Linear Regression, Decision Trees, RandomForest, XGBoost

Link To Code