Skip to content

bhavink/databricks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Hi there πŸ‘‹ I'm Bhavin Kukadia

πŸ” Cloud Data Platform Architect & Security Specialist

I design and implement secure, production-grade Data and AI platforms across Azure, AWS, and GCP. Specializing in Databricks architecture, zero-trust security, and infrastructure automation.

🎯 What I Do

  • πŸ—οΈ Build secure data lakehouses with Private Link, Unity Catalog, and data exfiltration protection
  • ☁️ Multi-cloud Databricks architecture for regulated industries (finance, healthcare, government)
  • βš™οΈ Infrastructure as Code with modular Terraform templates and automation frameworks
  • πŸ“ Share knowledge through technical articles and open source contributions

πŸ“š Recent Work

Latest Articles (13+ published on Databricks Blog):

πŸ’‘ Core Expertise

Security           Infrastructure        Multi-Cloud
━━━━━━━━━━━━━━━━━  ━━━━━━━━━━━━━━━━━━━━  ━━━━━━━━━━━━━
β€’ DEP Frameworks   β€’ Terraform Modules   β€’ Azure (ADB)
β€’ Unity Catalog    β€’ CI/CD Pipelines     β€’ AWS (DB)
β€’ Private Link     β€’ Config Management   β€’ GCP (DB)
β€’ CMK/Encryption   β€’ Custom Agents       β€’ VNet/VPC/VPC-SC
β€’ Network Security β€’ ADR Framework       β€’ Cross-Cloud

πŸ“« Connect


"Building secure, scalable data platforms that enable innovation while protecting what matters most."


Repository Contents: All Things Databricks βœ…

This repository contains production-ready infrastructure templates, ready-to-use code samples, how-to guides, and deployment architectures to help you learn and operate the Databricks Lakehouse on Azure, AWS, and GCP.


Quick Links πŸ”—

Cloud Description Path
πŸ”· Azure Production-ready security & modular Terraform deployment patterns adb4u
☁️ AWS Private Link workspace templates with DEP controls awsdb4u
🟒 GCP VPC-SC, Private Service Connect, CMEK implementations gcpdb4u
πŸ”Œ REST API Postman collections for Databricks REST APIs databricks-rest-api-collection
πŸš€ Jump Start Curated code samples and tutorials databricks-jump-start
πŸ› οΈ Utils Utilities and helper scripts databricks-utils

🌩️ Databricks Deployment Guides by Cloud

πŸ”· Azure (adb4u)

Production-Ready Modular Terraform Templates

  • βœ… Focus: Security, governance, and production-ready deployment patterns
  • πŸ—οΈ Architecture: Non-PL, Full Private (air-gapped), Hub-Spoke with firewall
  • πŸ” Security: Unity Catalog, Private Link, NPIP/SCC, CMK, Service Endpoints
  • πŸ“š Documentation: 2,300+ lines with UML diagrams, traffic flows, troubleshooting guides
  • πŸ“ Path: adb4u/

Key Features:

  • Modular Terraform structure (Networking, Workspace, Unity Catalog, Key Vault)
  • BYOV (Bring Your Own VNet/Subnet/NSG) support
  • Automated NSG rule management for SCC workspaces
  • Customer-Managed Keys with auto-rotation
  • Comprehensive deployment checklists and troubleshooting

Quick Start: See adb4u/docs/01-QUICKSTART.md


☁️ AWS (awsdb4u)

Private Link Workspace Templates with DEP Controls

  • 🎯 Focus: Deploying and operating Databricks on AWS with best practices
  • πŸ” Security: VPC design, Private Link, PrivateLink endpoints, data exfiltration protection
  • πŸ“Š Topics: S3 data access patterns, IAM roles and policies, cross-account setups
  • πŸ› οΈ Automation: Infrastructure templates and configuration management
  • πŸ“ Path: awsdb4u/

Key Features:

  • Private Link workspace deployments
  • Data Exfiltration Protection (DEP) controls
  • VPC and subnet design patterns
  • IAM role and policy automation
  • Cross-account setup guidance

🟒 GCP (gcpdb4u)

VPC-SC, Private Service Connect, CMEK Implementations

  • 🎯 Focus: GCP-specific guidance with emphasis on data plane security
  • πŸ” Security: VPC-SC perimeters, Private Service Connect, KMS integration
  • 🌐 Networking: VPC and subnet design, private connectivity patterns
  • πŸ”‘ Identity: IAM & service accounts, Workload Identity Federation
  • πŸ“ Path: gcpdb4u/

Key Features:

  • VPC Service Controls (VPC-SC) integration
  • Private Service Connect (PSC) for workspace connectivity
  • Google KMS integration for encryption
  • GCS connectors and data access patterns
  • Data exfiltration prevention patterns

πŸ”§ How to Use This Repository

1. Choose Your Cloud Platform

Pick the folder that matches your target environment:

2. Select Deployment Pattern

Each cloud folder contains multiple deployment patterns:

  • Non-Private Link: Public control plane + private data plane (NPIP)
  • Full Private: Private Link for both control and data planes
  • Hub-Spoke: Centralized networking with egress control

3. Follow Deployment Guides

  • Read the README in your chosen folder
  • Review architecture diagrams and documentation
  • Follow step-by-step deployment instructions
  • Use provided Terraform modules and templates

4. Explore Additional Resources


🌟 Highlighted Features

Production-Ready Templates

  • βœ… Modular Terraform code with conditional logic
  • βœ… Support for BYOV (Bring Your Own VNet/VPC)
  • βœ… Automated network security group rules
  • βœ… Unity Catalog with regional metastore management

Comprehensive Documentation

  • πŸ“š 2,300+ lines of detailed guides
  • πŸ“Š UML architecture and sequence diagrams
  • πŸ” Traffic flow analysis with cost breakdowns
  • ⚠️ Troubleshooting guides and deployment checklists

Security Best Practices

  • πŸ” Data Exfiltration Protection (DEP) frameworks
  • πŸ”‘ Customer-Managed Keys (CMK) with auto-rotation
  • 🌐 Private Link, VPC-SC, and network isolation
  • πŸ›‘οΈ Zero-trust architectures for regulated industries

✨ Contributing

Contributions are welcome! Please:

  1. Open issues for bugs, questions, or feature requests
  2. Submit pull requests for:
    • Documentation improvements
    • Additional cloud scenarios
    • New deployment templates
    • Bug fixes or enhancements

πŸ“„ License

This repository follows the licensing described in the project. Please see the LICENSE file (if present) or reach out for clarification.


πŸ”— Additional Resources

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •