Skip to content
View Nikelroid's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report Nikelroid

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Nikelroid/README.md

Hi, I'm Nima Kelidari 👋

AI / ML Engineer · MS Computer Science (AI) @ USC

Reinforcement Learning · Computer Vision · Large-Scale MLOps

🌐 Portfolio📄 CV💼 LinkedIn✉️ Email


🚀 About Me

I'm a Master's student in Computer Science (AI) at USC, with a BS from Sharif University of Technology. I build agents that learn under uncertainty and ship them on infrastructure that scales.

  • 🔭 Researching: adversarial co-evolution of RL and VLM/LLM agents
  • 🛠️ Recently shipped: PPO agents for imperfect-information games, MoE steering at inference time, probing frameworks for speech transformers
  • 🌱 Learning: ROS, control theory, advanced MLOps
  • 🤝 Open to collaborate on: robotics simulation, medical imaging
  • 💬 Ask me about: PPO and offline RL, computer vision, MLOps pipelines on GCP/AWS

🛠️ Tech Stack

Languages

ML & Deep Learning

RL & Simulation   Stable-Baselines3 · PettingZoo · Gymnasium · Ollama · vLLM

Data

MLOps & Cloud

Storage & Systems


📂 Featured Projects

Project What it does Stack
Risk-Scaled Steering in MoE Token-aware steering for MoE LLMs — 3D delta tensors that dynamically scale expert activations to improve safety at inference time. vLLM PyTorch HF
Linguistic-Agnostic SER Probing framework that measures how speech-emotion transformers encode paralinguistic vs. acoustic information across hidden layers. PyTorch HF
Adversarial Co-Evolution Trains PPO agents against LLM opponents in imperfect-information card games via curriculum learning and knowledge distillation. PPO Ollama
Multi-Modal Sentiment Classification Sentiment analysis over image-text conversations with time-dynamics exploration of multimodal cues. PyTorch Pandas

Replace the last row's link with the real repo URL — the original pointed to a Google search.


📊 GitHub

Pinned Loading

  1. adversarial-coevolution adversarial-coevolution Public

    Adversarial Co-Evolution of RL and LLM Agents: A framework for training high-performance PPO agents against Large Language Models in Gin Rummy, utilizing curriculum learning and knowledge distillat…

    Python 2

  2. multimodal-sentiment-classification multimodal-sentiment-classification Public

    A multimodal deep learning framework that fuses visual features from EfficientNet and textual features from BERT to classify sentiment in image-text conversations using the MSCTD dataset.

    Jupyter Notebook 9 2

  3. anime-recommender-application anime-recommender-application Public

    An end-to-end MLOps project implementing a deep learning-based anime recommendation system with automated CI/CD deployment to Google Kubernetes Engine.

    Python

  4. image-compression-svd-fft image-compression-svd-fft Public

    A Python implementation of lossy image compression and reconstruction utilizing Singular Value Decomposition (SVD) and Fast Fourier Transform (FFT) techniques for optimal storage efficiency.

    Jupyter Notebook 3

  5. artist-explorer-app artist-explorer-app Public

    A comprehensive full-stack web and Android application for exploring artists and artworks using the Artsy API, featuring JWT authentication, MongoDB favorites, and an experimental AI assistant.

    HTML

  6. mysql-metadata-manager mysql-metadata-manager Public

    A Python-based GUI application using Tkinter and MySQL to reverse-engineer, manage, and export database schema metadata.

    Jupyter Notebook 3