Volcengine MongoDB Sharded Cluster¶
Volcengine MongoDB Sharded Cluster Metrics display, including CPU usage, memory usage, connections, latency, OPS, etc.
Configuration¶
Install Func¶
It is recommended to activate the TrueWatch Integration - Extensions - DataFlux Func (Automata): All prerequisites are automatically installed, please continue with the script installation.
If deploying Func manually, refer to Manual Deployment of Func
Install Script¶
Note: Please prepare the required Volcengine AK in advance (for simplicity, you can directly grant the global read-only permission
ReadOnlyAccess
)
To synchronize the monitoring data of MongoDB cloud resources, we install the corresponding collection script: "TrueWatch Integration (Volcengine-MongoDB Collection)" (ID: integration_volcengine_mongodb_sharded_cluster
)
Click 【Install】, then enter the corresponding parameters: Volcengine AK, Volcengine account name.
Click 【Deploy Startup Script】, the system will automatically create the Startup
script set and configure the corresponding startup script.
After enabling, you can see the corresponding automatic trigger configuration in 「Manage / Automatic Trigger Configuration」. Click 【Execute】 to immediately execute it once without waiting for the scheduled time. Wait a moment, and you can view the execution task records and corresponding logs.
If you want to collect corresponding logs, you also need to enable the corresponding log collection script. If you want to collect bills, you need to enable the cloud bill collection script.
We have collected some configurations by default, see the Metrics section for details.
Configure Custom Cloud Object Metrics
Verification¶
- In 「Manage / Automatic Trigger Configuration」, confirm whether the corresponding task has the corresponding automatic trigger configuration, and you can also check the corresponding task records and logs to see if there are any exceptions.
- In TrueWatch, 「Infrastructure / Custom」, check if the asset information exists.
- In TrueWatch, 「Metrics」, check if there is corresponding monitoring data.
Metrics¶
After configuring the Volcengine-MongoDB Sharded Cluster monitoring, the default Measurement is as follows. You can collect more Metrics through configuration Volcengine MongoDB Monitoring Metrics Details
MetricName |
Subnamespace |
Metric Name | MetricUnit | Dimension |
---|---|---|---|---|
LogDiskUsage |
config |
Log Disk Usage | Bytes(SI) | ResourceID,Node |
RunningConcurrentWriteRequest |
config |
Current Write Concurrent Requests | Count | ResourceID,Node |
RunningConcurrentReadRequest |
config |
Current Read Concurrent Requests | Count | ResourceID,Node |
CommandOperationPerSec |
config |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
ReplicationDelay |
config |
Primary-Secondary Delay | Second | ResourceID,Node |
NetworkReceiveThroughput |
config |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUsage |
config |
Total Disk Usage | Bytes(SI) | ResourceID,Node |
UpdateOperationPerSec |
config |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
MaxDiskConfigured |
config |
Configured Maximum Disk Space | Bytes(SI) | ResourceID,Node |
TimeOutCursor |
config |
Cursor Timeouts | Count | ResourceID,Node |
CurrConn |
config |
Current Connections | Count | ResourceID,Node |
DataDiskUsage |
config |
Data Disk Usage | Bytes(SI) | ResourceID,Node |
ReadIntoCachePerSec |
config |
Data Read Into Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUtil |
config |
Total Disk Utilization | Percent | ResourceID,Node |
GlobalWaitWriteLockQueue |
config |
Global Write Lock Wait Queue Length | Count | ResourceID,Node |
TotalOpenCursor |
config |
Total Open Cursors | Count | ResourceID,Node |
NetworkTransmitThroughput |
config |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GlobalWaitReadLockQueue |
config |
Global Read Lock Wait Queue Length | Count | ResourceID,Node |
AvailConcurrentReadRequest |
config |
Available Read Concurrent Requests | Count | ResourceID,Node |
MemUtil |
config |
Memory Utilization | Percent | ResourceID,Node |
GlobalWaitTotalLockQueue |
config |
Global Lock Wait Queue Total Length | Count | ResourceID,Node |
CpuUtil |
config |
CPU Utilization | Percent | ResourceID,Node |
GetmoreOperationPerSec |
config |
GETMORE Operations Per Second |
Count/Second | ResourceID,Node |
DeleteOperationPerSec |
config |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
AvailConcurrentWriteRequest |
config |
Available Write Concurrent Requests | Count | ResourceID,Node |
QueryOperationPerSec |
config |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
WrittenFromCachePerSec |
config |
Data Written From Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
NetworkRequestPerSec |
config |
Network Requests Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
config |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
SlowOpCount |
config |
Slow Query Count | Count | ResourceID,Node |
OplogAvailTime |
config |
Oplog Available Time |
Second | ResourceID,Node |
AggregatedCpuUtil |
instance |
CPU Utilization | Percent | ResourceID |
AggregatedMemUtil |
instance |
Memory Utilization | Percent | ResourceID |
AggregatedTotalDiskUtil |
instance |
Total Disk Space Utilization | Percent | ResourceID |
ChunkNumber |
instance |
Shard Chunk Count | Count | ResourceID,shard |
NetworkTransmitThroughput |
mongos |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GetmoreOperationPerSec |
mongos |
GETMORE Operations Per Second |
Count/Second | ResourceID,Node |
MemUtil |
mongos |
Memory Utilization | Percent | ResourceID,Node |
CpuUtil |
mongos |
CPU Utilization | Percent | ResourceID,Node |
QueryOperationPerSec |
mongos |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
DeleteOperationPerSec |
mongos |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
mongos |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
NetworkRequestPerSec |
mongos |
Network Requests Per Second | Count/Second | ResourceID,Node |
NetworkReceiveThroughput |
mongos |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
CommandOperationPerSec |
mongos |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
CurrConn |
mongos |
Current Connections | Count | ResourceID,Node |
UpdateOperationPerSec |
mongos |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
ReadIntoCachePerSec |
shard |
Data Read Into Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUtil |
shard |
Total Disk Utilization | Percent | ResourceID,Node |
GlobalWaitWriteLockQueue |
shard |
Global Write Lock Wait Queue Length | Count | ResourceID,Node |
TotalOpenCursor |
shard |
Total Open Cursors | Count | ResourceID,Node |
NetworkTransmitThroughput |
shard |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GlobalWaitReadLockQueue |
shard |
Global Read Lock Wait Queue Length | Count | ResourceID,Node |
AvailConcurrentReadRequest |
shard |
Available Read Concurrent Requests | Count | ResourceID,Node |
DataDiskUsage |
shard |
Data Disk Usage | Bytes(SI) | ResourceID,Node |
GlobalWaitTotalLockQueue |
shard |
Global Lock Wait Queue Total Length | Count | ResourceID,Node |
CpuUtil |
shard |
CPU Utilization | Percent | ResourceID,Node |
GetmoreOperationPerSec |
shard |
GETMORE Operations Per Second |
Count/Second | ResourceID,Node |
MemUtil |
shard |
Memory Utilization | Percent | ResourceID,Node |
AvailConcurrentWriteRequest |
shard |
Available Write Concurrent Requests | Count | ResourceID,Node |
QueryOperationPerSec |
shard |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
DeleteOperationPerSec |
shard |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
NetworkRequestPerSec |
shard |
Network Requests Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
shard |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
WrittenFromCachePerSec |
shard |
Data Written From Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
RunningConcurrentWriteRequest |
shard |
Current Write Concurrent Requests | Count | ResourceID,Node |
LogDiskUsage |
shard |
Log Disk Usage | Bytes(SI) | ResourceID,Node |
CommandOperationPerSec |
shard |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
ReplicationDelay |
shard |
Primary-Secondary Delay | Second | ResourceID,Node |
NetworkReceiveThroughput |
shard |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
RunningConcurrentReadRequest |
shard |
Current Read Concurrent Requests | Count | ResourceID,Node |
UpdateOperationPerSec |
shard |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
MaxDiskConfigured |
shard |
Configured Maximum Disk Space | Bytes(SI) | ResourceID,Node |
TimeOutCursor |
shard |
Cursor Timeouts | Count | ResourceID,Node |
CurrConn |
shard |
Current Connections | Count | ResourceID,Node |
TotalDiskUsage |
shard |
Total Disk Usage | Bytes(SI) | ResourceID,Node |
SlowOpCount |
shard |
Slow Query Count | Count | ResourceID,Node |
OplogAvailTime |
shard |
Oplog Available Time |
Second | ResourceID,Node |
Objects¶
The collected Volcengine MongoDB object data structure can be seen in 「Infrastructure - Custom」
{
"category": "custom_object",
"fields": {
"NodeSpec": "rds.mysql.d1.n.1c1g",
"TimeZone": "UTC +08:00",
...
},
"measurement": "volcengine_mongodb_replica_set",
"tags": {
"AllowListVersion": "initial",
"DBEngineVersion": "MongoDB_5_7",
"InstanceId": "mysql-xxx",
"InstanceName": "mysql-xxx",
"InstanceStatus": "Running",
"InstanceType": "DoubleNode",
"LowerCaseTableNames": "1",
"NodeNumber": "2",
"ProjectName": "default",
"RegionId": "cn-beijing",
"StorageSpace": "20",
"StorageType": "LocalSSD",
"SubnetId": "subnet-xxx",
"VpcId": "vpc-xxx",
"ZoneId": "cn-beijing-a",
"name": "mysql-xxx"
}
}