Step 5: Create Data Mirror Task
Creating Your First Data Mirror Task
Now that you have set up your source database and ClickHouse target, you're ready to create a data mirror task. This task will continuously synchronize data from your source to ClickHouse, keeping your analytics data up-to-date in real-time.
Step-by-Step Guide
1. Navigate to the Tasks Section
In the ClickPipes dashboard, click on Tasks in the left navigation menu, then click the Create button.
2. Select Source and Target
In the task creation wizard:
- Select your source connection
- Select your ClickHouse target connection
3. Select Tables to Mirror
Choose which tables you want to mirror from your source database:
- You can select individual tables or use the Select All option
- For each table, you can customize:
- Which columns to include
- Whether to sync full data or incremental changes only
- Transformation rules (optional)
4. Configure Mirror Settings
Configure various task-related settings, such as:
- Performance Configuration:
- Source Table Concurrency: Control the number of tables processed simultaneously
- Single Table Read Concurrency: Optimize reading performance for large tables
- Target Write Batch Size: Adjust batch sizes to balance performance and resource usage
- Resources and Monitoring:
- Memory Usage: Set memory limits for the task
- Scheduled Execution: Configure task execution time and frequency
- Task Alerts: Set up notification rules for exception scenarios
5. Start the Task
Review your settings and click Start Task to begin the synchronization process.
6. Monitor Progress
Once your task is running, you can monitor its progress:
- The dashboard shows real-time metrics including:
- Records processed
- Current throughput
- Sync status for each table
- Latency
- Wait for the initial synchronization to complete. Depending on your data volume, this could take from minutes to hours.
Next Steps
After your data is successfully mirrored to ClickHouse, you can:
- Query your data directly in ClickHouse for analytics
- Create Data APIs to expose your data through HTTP endpoints
- Connect BI tools to your ClickHouse instance for visualization