This collection of open source projects is created for educational purposes to help students and researchers learn about:
- System Design and Distributed Platforms
- AI Agents and Large Language Models
- Data Engineering and Analytics
- Research-to-Implementation Workflows
- Cloud Orchestration and Scalability
- Modern Software Development Practices
Declarative LLM Programming with DSPy • Progressive labs for learning
⭐
LLM Fine-Tuning Practice • PEFT, LoRA, and advanced techniques
⭐
DuckDB Analytics Practice • SQL optimization and analytics
⭐
Apache Spark Practice • Batch and stream processing
⭐
Apache Iceberg Lakehouse Practice • Time travel and schema evolution
⭐
Apache Beam Practice • Unified batch and stream processing
⭐
Scala Data Analysis Practice • Functional programming and big data
⭐
Automated Research Collection • Multi-source paper aggregation and scoring
⭐
Research to Implementation • Agentic framework for converting papers to code
⭐
Quantitative Finance Research • Framework for implementing trading strategies
⭐
Cloud Orchestration Gateway • Scaling research-to-repo pipelines
⭐
AGI/ASI Educational Resources • Curated collection for learning
⭐
AI-Powered Research Assistant • Academic paper discovery and analysis
⭐
Table Formats Education • Comparison of Iceberg, Delta Lake, Hudi
⭐
Technical Learning Notes • AI/ML, System Design, Data Engineering
⭐
- Start with practice repositories for hands-on learning
- Progress to framework projects for understanding architecture
- Explore research resources for advanced concepts
- Use research tools for literature review automation
- Study implementation frameworks for research-to-code workflows
- Leverage cloud orchestration for scaling research
- Practice with code repositories for skill development
- Study framework architecture for system design
- Explore data engineering tools for infrastructure learning
⭐ Star repositories you find helpful for your learning journey!
All projects created for educational purposes
