Reading & Tools
A curated set of papers, tools, and learning resources that inform my work across reinforcement learning, ML systems, automation, and security-aware engineering. This list will evolve as my work deepens.
Selected Papers & Notes
References that shape how I reason about systems, safety, and reliability.
Reinforcement Learning & Multi-Agent Systems
-
Schulman et al. (2017) — Proximal Policy Optimization Algorithmsarxiv.org/abs/1707.06347
-
Yu et al. (2022) — The Surprising Effectiveness of MAPPO in Cooperative Multi-Agent Gamesarxiv.org/abs/2103.01955
Machine Learning Systems
-
Sculley et al. (2015) — Hidden Technical Debt in Machine Learning Systemspapers.nips.cc/.../86df7dcf
Security Foundations
-
Kocher et al. (1999) — Differential Power Analysisiacr.org/archive/crypto99/1666/16660036.pdf
Tools & Frameworks
Practical stacks I lean on for experiments, automation, and security-aware builds.
ML / RL
- PyTorch — pytorch.org
- Stable-Baselines3 — stable-baselines3.readthedocs.io
- Weights & Biases (experiment tracking) — wandb.ai
Simulation & Robotics
- PyBullet — pybullet.org
- OpenAI Gym / Gymnasium — gymnasium.farama.org
Automation & Engineering
- Python (argparse, logging, pathlib) — docs.python.org/3
- Makefiles & simple shell scripting — gnu.org/software/make/manual/
- Git — git-scm.com/doc
Security & Analysis
- ChipWhisperer (learning reference) — chipwhisperer.readthedocs.io
- OpenSSL documentation — openssl.org/docs/
Courses & Guides
Hands-on material I use to tighten fundamentals and ship responsibly.
- Deep Reinforcement Learning (Berkeley CS285) — rail.eecs.berkeley.edu/deeprlcourse/
- Spinning Up in Deep RL (OpenAI) — spinningup.openai.com
- Learn2Cloud (cloud fundamentals) — learn2cloud.com
- OWASP Top 10 — owasp.org/www-project-top-ten/