Purpose: Comprehensive reference documentation for building framework wrappers around the Dataiku Python API.
Audience:
- Developers building frameworks/wrappers
- Claude Code sessions doing deep API work
- Anyone needing complete API reference
Sequential guides for understanding the entire API:
-
- Project structure recommendations
- API key setup and security
- File naming conventions
- Configuration patterns
-
- API architecture and design
- Main modules and their purposes
- Package organization
- Entry points and client types
-
- Authentication methods
- Client initialization patterns
- Connection management
- Multi-instance scenarios
-
- Complete project API reference
- All project-level operations
- Project settings and metadata
- Project contents access
-
- Dataset types and creation
- Schema management
- Data access patterns
- Build and partition operations
-
- Recipe types reference
- Recipe creation patterns
- Execution and monitoring
- Schema propagation
-
- Scenario structure
- Steps and triggers
- Execution patterns
- Monitoring and logging
-
- Job lifecycle
- Status monitoring
- Log access
- Async patterns with futures
-
- ML task reference
- Model training and evaluation
- Saved model management
- Deployment patterns
-
- User and group management
- Connection management
- Code environments
- Instance settings
Deep technical reference:
- classes_and_methods.md - Complete class/method inventory (1,162 lines)
- class_index.md - Quick reference organized by category (500 lines)
- common_patterns.md - Workflow patterns and examples
- coverage_analysis.md - Gap analysis and recommendations
- README.md - How to use the inventory
Build a framework wrapper:
- Start with
documentation/01-prerequisites-and-setup.md - Read
documentation/02-api-overview.mdfor architecture - Use
api_inventory/classes_and_methods.mdas complete reference - Check
api_inventory/common_patterns.mdfor implementation patterns
Find a specific class/method:
- Use
api_inventory/class_index.mdfor quick lookup - Check
api_inventory/classes_and_methods.mdfor details
Understand workflows:
- Read relevant
documentation/0X-*.mdguide - Check
api_inventory/common_patterns.mdfor code examples
Assess what's possible:
- Skim
api_inventory/class_index.mdfor categories - Review
api_inventory/coverage_analysis.mdfor completeness
claude-guides/ |
dataiku_framework_reference/ |
|---|---|
| Usage-focused workflow guides | Complete API reference |
| How to accomplish tasks | What exists in the API |
| Best practices and gotchas | Technical specifications |
| Quick project building | Framework development |
| ~6,600 lines | ~10,000+ lines |
Use both together:
claude-guides/- For building Dataiku projects quicklydataiku_framework_reference/- For building wrappers/frameworks
API Inventory:
- 150+ classes documented
- 1,000+ methods cataloged
- 7 client types covered
- 15 functional categories
- 50+ workflow examples
Reference Guides:
- 10 comprehensive sections
- Complete method signatures
- Parameter documentation
- Return type specifications
- Example code throughout
When building with this reference:
your_project/
├── config/
│ ├── APIKEY.txt # Your API key (never commit!)
│ └── config.json # Environment config
├── dataiku_framework_reference/ # This documentation (read-only)
│ ├── documentation/
│ ├── api_inventory/
│ └── README.md
├── claude-guides/ # Usage guides (read-only)
├── your_framework/ # Your wrapper code
│ ├── __init__.py
│ ├── client.py
│ ├── projects.py
│ └── ...
├── tests/ # Your tests
├── examples/ # Usage examples
└── {PROJECT_KEY}_script.py # Project-specific scripts
-
Read prerequisites:
cat documentation/01-prerequisites-and-setup.md
-
Understand API architecture:
cat documentation/02-api-overview.md
-
Explore what's available:
cat api_inventory/class_index.md
-
Dive into specific areas:
cat documentation/04-project-operations.md # For project wrappers cat documentation/05-dataset-operations.md # For dataset wrappers # etc.
-
Find a class:
grep "class DSSDataset" api_inventory/classes_and_methods.md -A 20 -
Find a method:
grep "def build" api_inventory/classes_and_methods.md -
Find patterns:
cat api_inventory/common_patterns.md | grep -A 10 "Dataset Processing"
When updating:
- Keep
api_inventory/as pure technical reference - Put workflow guidance in
documentation/ - Include method signatures and return types
- Add examples for complex patterns
- Update coverage analysis when finding gaps
- API Version: 14.1.3+
- Last Updated: 2025-11-21
- Python: 3.7+
- Total Lines: ~10,000+
- Usage Guides:
../claude-guides/(for building projects) - Official Docs: https://doc.dataiku.com/
- API Reference: https://developer.dataiku.com/latest/api-reference/python/
- GitHub: https://github.com/dataiku/dataiku-api-client-python
Ready to build? Start with documentation/01-prerequisites-and-setup.md