Prazwal Pandey
Python Developer | Data Engineer
Data Engineer focused on building reliable large-scale data pipelines and real-time ingestion systems. I work with distributed architectures, OLAP infrastructure, and backend tooling to turn raw data into production-grade systems.
About
I'm a Python developer and data engineer focused on building scalable data systems, ingestion pipelines, and distributed architectures. My work involves real-time blockchain data ingestion, OLAP data infrastructure, and building robust backend tooling. I care deeply about system reliability, clean code, and making data accessible at scale.
Tech Stack & Tools
Languages
Infrastructure & Tools
Data Engineering
Data Platforms
Experience
Kavaya.ai
May 2025 — PresentData Engineer
Solana Data Ingestion Pipeline
- Built real-time blockchain ingestion pipelines using Go (RPC and gRPC).
- Used Bento for stateless transformation and WarpStream for streaming.
- Achieved robust fault-tolerant architecture with ~1.4 second average ingestion delay.
Newsletter Scraping System
- Built large-scale scraping pipelines using Python and Playwright.
Backtest Engine CLI
- Developed a command-line tool for a backtesting platform using Go and Cobra.
Vendor Based Base (Ethereum L2) Data Ingestion
- Built ingestion pipelines using Goldsky APIs.
Lakehouse Infrastructure
- Implemented Iceberg-based lakehouse architecture.
- Polaris as catalog, Cloudflare R2 for S3 compatible storage.
- Trino as distributed query engine.
Spark System
- Experience building data processing systems using PySpark.
Education
Bachelor of Computer Engineering
Pulchowk Campus
Institute of Engineering, Nepal
Licensed Computer Engineer
Nepal Engineering Council
General Category
Get in Touch
I'm always open to discussing data engineering, distributed systems, or interesting projects. Feel free to reach out.