
Overview
The VIP Topics section displays a health overview graph showing topics identified as important to your infrastructure. VIP topics are determined by two key metrics:- Consumer group count - Topics with many subscribing consumer groups (shown by bar height)
- Message volume - Topics with high throughput and data volume (shown by color intensity)
VIP topics are automatically identified based on cluster-wide analysis of consumer patterns and message throughput. You can use this data-driven approach to prioritize operational focus and monitoring resources.
What the graph shows
VIP topics health overview
The bar graph visualizes your most important topics using two dimensions. Bar height represents the number of consumer groups subscribed to each topic:- Taller bars indicate more consumer groups depend on the topic
- Many consumers suggest the topic provides data critical to multiple applications
- Wide usage indicates potential for widespread impact if issues occur
- Darker blue indicates higher message volume and throughput
- Lighter blue indicates lower message volume
- Message volume combined with consumer count identifies truly critical topics
VIP topic health overview table
Below the graph, an expandable VIP topic health overview table provides detailed metrics for topics with recommendations. The table header shows the percentage of VIP topics that have recommendations requiring attention.- Topic: topic name with topic type label (internal, streams, or user) and any custom labels
- Fan-out: number of consumer groups subscribed to the topic
- Messages: total message count showing volume
- Replication factor: number of replicas for fault tolerance
- Skew: partition imbalance percentage
Clicking a topic type or label in the table applies it as a global filter across all Insights sections.
Interpreting the visualization
Tall, dark bars - Topics with many consumers and high message volume are your most business-critical data pipelines requiring highest priority for monitoring and careful change management. Tall, light bars - Topics with many consumers but lower message volume may represent configuration or control topics that still require careful management despite lower throughput. Short, dark bars - Topics with high message volume but fewer consumers represent specialized high-throughput pipelines requiring performance optimization and capacity planning.Why VIP topics matter
VIP topics represent the backbone of your data infrastructure. Multiple applications and consumer groups depend on these topics for critical business processes. Issues with VIP topics have cascading impact across downstream applications and services. High traffic makes VIP topics susceptible to performance problems, while configuration issues create data loss risk. Changes to VIP topics require coordination across teams and careful testing. Poor replication factors, insufficient partitions or inadequate retention can cause widespread application failures. VIP topics drive infrastructure capacity planning and justify investment in comprehensive monitoring, stricter alerting thresholds and dedicated operational procedures.Actions to take in Console
For each VIP topic identified in the dashboard, verify and optimize configurations across multiple areas. Expand each section to review specific actions and guidance.Review topic configuration
Review topic configuration
Validate that VIP topics have production-grade settings:
Navigate to the topic
Go to Topics from the main menu and select a VIP topic identified from the Insights dashboard.
Verify critical settings
Check the following configurations:Replication factor:
- Has to be at least 3 for production VIP topics
- Provides fault tolerance for broker failures
- Ensures data durability and availability
- Check the data loss risk graph in risk analysis for topics with insufficient replication
retention.ms- Time-based retention appropriate for business needsretention.bytes- Size-based retention per partition if applicable- Consider longer retention for VIP topics to support late-arriving consumers
- Sufficient partitions for current and projected throughput
- Ideally a multiple of broker count for even distribution
- Adequate parallelism for all consumer groups
delete- For time-series or event datacompact- For state or changelog topics- Appropriate for the data model and consumption patterns
VIP topics should never have replication factor of 1 or 2 in production environments. If the Risk Analysis section identifies data loss risk for VIP topics, address these immediately.
Set up monitoring and alerts
Set up monitoring and alerts
Implement proactive monitoring to detect issues before they impact consuming applications:Set up alerts for topic monitoring
Navigate to the VIP topic
Go to Topics and select the VIP topic you want to monitor. Click the Alerts tab on the topic detail page.
Configure consumer lag alerts
Set up alerts for consumer group lag thresholds:
- Define acceptable lag limits based on business requirements
- Use stricter thresholds for VIP topics than standard topics
- Alert on both absolute lag (message count) and time-based lag
For VIP topics, consider alerting when any consumer group exceeds 1000 messages of lag or 5 minutes of time-based lag, rather than the default thresholds used for standard topics.
Create under-replicated partition alerts
Configure alerts for replication issues:
- Alert immediately if any partitions become under-replicated
- Under-replicated partitions indicate broker issues or failures
- Critical for VIP topics where data loss risk is unacceptable
Set disk usage alerts
Monitor storage consumption for VIP topics:
- Alert on rapid growth that could cause disk space issues
- Track retention effectiveness
- Plan capacity expansions before reaching limits
Establish ownership and governance
Establish ownership and governance
Ensure VIP topics have clear ownership and governance from the start through self-service workflows rather than manual tracking.Use self-service for automatic ownershipConduktor’s self-service framework provides a GitOps approach to topic lifecycle management where Applications dictate ownership of Kafka resources. When VIP topics are managed through self-service:
Self-service is available with Conduktor Scale Plus only.
- Automatic ownership tracking - applications define owners and business context at topic creation
- Clear accountability - Console automatically assigns ownership to application teams
- Governance enforcement - policies ensure VIP topics meet configuration standards (RF=3, appropriate retention)
- Business context preserved - application definitions maintain documentation about purpose, dependencies and SLAs
- When issues occur, the right teams are contacted immediately through defined ownership
- Configuration changes follow approval workflows specific to business-critical topics
- Governance policies prevent VIP topics from being created with suboptimal settings
- Topic purpose and dependencies are documented in application definitions
Review access controls and security
Review access controls and security
Ensure VIP topics have appropriate security configurations:
Verify RBAC permissions
Navigate to Settings > RBAC and review permissions for the VIP topic:
- Limit producer permissions to authorized applications only
- Restrict consumer access to approved teams and services
- Require elevated permissions for configuration changes
- Audit permissions regularly for VIP topics
Monitor consumer health
Monitor consumer health
Track all consumer groups subscribed to VIP topics:
Navigate to the VIP topic
Go to Topics and select the VIP topic you want to analyze. Click the Consumer Groups tab on the topic detail page.
Review consumer group metrics
For each consumer group, examine:
- Lag - Current lag per partition and total lag
- State - Active consumers or empty groups
- Members - Number of active consumer instances
- Commit frequency - How often consumers commit offsets
Review partition distribution
Review partition distribution
Ensure optimal partition allocation across brokers:
Navigate to the VIP topic
Go to Topics and select the VIP topic you want to review. Click the Partitions tab on the topic detail page.
Analyze partition distribution
Review the Per broker view to verify:
- Partitions are evenly distributed across all brokers
- Leadership is balanced (no single broker leads most partitions)
- No brokers are excluded from the topic
- Replica assignments provide proper fault tolerance
The cluster efficiency graph in risk analysis identifies partition distribution issues. For VIP topics, prioritize resolving distribution problems to avoid broker hotspots and performance bottlenecks.
Check for load imbalance
Switch to Per partition view and compare:
- Partition sizes across all partitions
- Message counts per partition
- Offset ranges (begin offset to end offset)
Track performance metrics
Track performance metrics
Monitor VIP topic performance over time:
Navigate to the VIP topic
Go to Topics and select the VIP topic you want to monitor. Click the Monitoring tab on the topic detail page.
Review produce metrics
Analyze produce patterns:
- Messages in per second - Produce rate over time
- Bytes in per second - Data volume throughput
- Look for unusual spikes, drops or patterns
- Establish baseline performance for capacity planning
Review consume metrics
Analyze consume patterns:
- Messages out per second - Consume rate across all consumer groups
- Bytes out per second - Data volume being consumed
- Compare consume rate to produce rate to identify accumulation
Troubleshooting
Why does a low-volume topic appear as a VIP topic?
Why does a low-volume topic appear as a VIP topic?
VIP topic identification is based on both message volume and consumer count. A topic may appear as VIP due to high consumer count (configuration topics read by many applications, control plane topics, event notification topics) or because it’s business-critical despite lower volume (compliance, regulatory or SLA requirements). Review the specific topic context to verify the VIP designation is appropriate for your organization’s needs.
Should all VIP topics have the same configuration?
Should all VIP topics have the same configuration?
No. VIP topics should share minimum standards (replication factor of 3, configured alerting, documented ownership, proper RBAC permissions) but configurations should be tailored to each topic’s requirements.Partition count, retention, cleanup policy, compression and security settings should be based on specific throughput needs, data lifecycle requirements and sensitivity classification. For example, a high-volume transaction topic may need 50 partitions and 7-day retention, while a configuration topic may need 3 partitions and 90-day retention.
Document organization-wide minimum standards for VIP topics while allowing flexibility for topic-specific optimizations.
What if VIP topic health score is poor?
What if VIP topic health score is poor?
A poor health score indicates configuration or operational issues. Address systematically:
- Identify issues - check all recommendations in the VIP topics section and review the risk analysis.
- Prioritize by impact - critical (data loss risk with RF < 3); high (under-replicated partitions); medium (load imbalance); low (cluster efficiency issues).
- Take action - for risk of data loss, follow the replication remediation steps. For cluster efficiency, follow partition troubleshooting steps. For load imbalance, resolve issues with partition skew.
- Verify improvement - Monitor the health score after remediation to confirm resolution