Automated synchronization of VPC firewall rules between dev and prod environments for compliance

barbara_pro · October 10, 2025, 3:36pm

We’ve successfully implemented an automated firewall rule synchronization solution for our IBM Cloud VPC environments using Schematics and Terraform. The challenge was maintaining consistent security policies across dev, staging, and production while preventing configuration drift and ensuring compliance.

Our setup involves multiple VPCs with complex firewall rules that need to be synchronized while respecting environment-specific variations. The automation handles rule creation, updates, and drift detection through CI/CD pipeline integration. We also needed compliance reporting to track rule changes and ensure audit requirements are met.

The solution leverages Terraform workspaces in Schematics to manage environment-specific configurations while maintaining a single source of truth. Key components include automated rule validation, drift detection mechanisms, and integration with our GitLab CI/CD pipeline for continuous synchronization.

Happy to share implementation details and lessons learned from this project.

scott_dev · October 22, 2025, 8:24am

Great questions! Let me address both comprehensively.

Firewall Rule Automation & Drift Detection: We run scheduled terraform plan operations every 2 hours via Schematics to detect drift. When drift is detected, the system creates a GitLab issue with details and sends Slack notifications to the ops team. For emergency manual changes, we have a grace period of 4 hours before auto-remediation kicks in, giving teams time to document and submit proper change requests. Critical production rules have stricter 1-hour detection with immediate alerts.

Our drift detection logic:


# Scheduled Schematics job checks state
terraform plan -detailed-exitcode
if [ $? -eq 2 ]; then
  log_drift_event && notify_team
fi

CI/CD Pipeline Integration: Our GitLab pipeline has three stages for firewall changes:

Validation Stage: Runs on every commit - terraform validate, tflint for best practices, and custom Python scripts that check rules against security baselines. This includes verifying no overly permissive rules (0.0.0.0/0) exist except for documented exceptions.
Plan Stage: Generates terraform plan for all affected environments. For dev/staging, this runs automatically. For production, it requires manual trigger by team lead.
Apply Stage: Dev/staging auto-apply after successful plan. Production requires two-stage approval: security team member + infrastructure manager. We use GitLab’s approval rules feature with required approvers from specific groups.

The pipeline integrates with Schematics API to trigger workspace operations and retrieve logs. We maintain separate Schematics workspaces per environment but share the same Git repository with branch protection rules.

Compliance Reporting Implementation: Our compliance system tracks three key metrics:

Change Attribution: Every firewall modification is linked to Git commit, author, approvers, and JIRA ticket through commit message parsing.
Rule Lifecycle: We maintain a JSON database tracking when each rule was created, modified, and by whom. Monthly reports show rule age, modification frequency, and compliance status.
Policy Violations: Automated scans check for rules violating security policies (overly broad access, deprecated protocols, missing descriptions). Violations trigger immediate alerts and block deployments.

We export compliance data to IBM Cloud Object Storage for long-term retention and generate monthly PDF reports using Python with the ReportLab library. The reports include rule change summaries, drift incidents, policy violations, and remediation actions taken.

Key Lessons Learned:

Start with read-only drift detection before enabling auto-remediation
Document all environment-specific exceptions in code comments
Implement gradual rollout: dev → staging → canary prod → full prod
Use Terraform modules for reusable firewall rule patterns
Maintain a rule naming convention that includes purpose and owner

The entire solution reduced our firewall management overhead by 70% and eliminated configuration drift incidents. Compliance audit preparation time dropped from 2 weeks to 2 hours with automated reporting.

Happy to share specific code snippets or discuss integration with other IBM Cloud services like Security and Compliance Center if needed.

markcoder · October 17, 2025, 2:27pm

How do you handle the compliance reporting aspect? We need to generate audit reports showing who changed what rules and when. Does Schematics provide built-in audit logging, or did you build custom reporting on top of it?

garymaster · October 16, 2025, 12:19am

We use Terraform workspaces with a single codebase and separate state files per environment. The key is using variable files for environment-specific overrides. Our base configuration defines common rules, and each environment has a tfvars file for exceptions. For example, dev might allow broader SSH access while prod restricts it to bastion hosts only. We also tag all resources with environment labels for tracking.

thomasarchitect · October 12, 2025, 4:55pm

This sounds exactly like what we need! We’re struggling with manual firewall rule updates across environments. How did you structure your Terraform code to handle environment-specific variations while keeping the core rules synchronized? Did you use separate state files for each environment?

raymonddev · October 22, 2025, 6:21am

Interested in your CI/CD integration approach. Are you running terraform plan on every commit to detect changes? How do you handle the approval workflow for production firewall changes? We need multiple approvers for prod security groups.

brenda_func · October 20, 2025, 2:49pm

What about drift detection? If someone manually changes a rule in the console, how quickly does your system detect and remediate it? We’ve had issues with emergency changes bypassing automation.

sharon_wizard · October 18, 2025, 4:29am

Schematics provides activity tracking through IBM Cloud Activity Tracker, which logs all workspace operations. We enhanced this with custom reporting using the Schematics API to extract change history and correlate it with Git commits. Our pipeline generates weekly compliance reports in JSON format that map firewall rule changes to specific pull requests and approvers. We also implemented pre-commit hooks that validate rules against our security policies before they reach the pipeline. This caught several misconfigurations before deployment.

Topic		Views
Automated compliance policy enforcement for GL module compute infrastructure Oracle Cloud use-case , compute , terraform , oci-2019 , python , compliance-governance , infrastructure-as-code , oci-compute , policy-automation	7	September 18, 2025
Automated network policy enforcement using OCI security APIs reduces compliance violations in regulated environments Oracle Cloud use-case , ci-cd , audit , automation , compliance , rest-api , oci-2019 , python , apis	4	September 30, 2025
Automated IAM policy enforcement for resource tagging improved audit compliance IBM Cloud use-case , security , compliance , iam , terraform , ic-2019 , schematics , activity-tracker , resource-tagging	3	June 30, 2025
Network segmentation for compliance: How granular should firewall rules be? Google Cloud Platform (GCP) discussion , networking , vpc , gcp-2021 , policy-enforcement , firewall-rules , compliance-governance , segmentation-complexity	5	July 19, 2025
ERP firewall management strategies for hybrid cloud networks with Azure Microsoft Azure discussion , networking , security , hybrid-cloud , az-2020 , nsg , azure-firewall , firewall-management , change-automation	4	March 3, 2025
Automated IAM policy enforcement for API Connect gateway reducing security incidents IBM Cloud use-case , security , automation , rest-api , iam , ic-2019 , python , policy-enforcement , api-connect	5	April 20, 2025
Automated security policy alerts for suspicious device behavior reduced incident response by 70% Microsoft Azure IoT use-case , automation , security-policy , json , alerting , azure-monitor , incident-response , aziot-24 , security-alerts	7	December 4, 2025
Automated database schema synchronization across environments using OCI Resource Manager and Terraform Oracle Cloud use-case , ci-cd , automation , database , devops , terraform , oci-2021 , schema-drift , resource-manager	7	August 17, 2025
Automated compliance audit reporting reduced manual review time by 75% for financial controls Tableau use-case , data-quality , reporting , tab-2023-3 , tableau-desktop , compliance-automation , audit-tracking , validation-rules , financial-reporting	6	August 21, 2025

Automated synchronization of VPC firewall rules between dev and prod environments for compliance

Related topics