Volcengine MongoDB Sharded Cluster¶
Volcengine MongoDB sharded cluster metrics display, including CPU usage, memory usage, connections, latency, OPS, etc..
Configuration¶
Install Func¶
It is recommended to activate the TrueWatch integration - extensions - DataFlux Func (Automata): all prerequisites are automatically installed, please proceed with the script installation.
If deploying Func manually, refer to Manual Func Deployment
Install Script¶
Note: Please prepare the required Volcengine AK in advance (for simplicity, you can grant the global read-only permission
ReadOnlyAccess
).
To synchronize the monitoring data of MongoDB cloud resources, we install the corresponding collection script: 「TrueWatch Integration (Volcengine-MongoDB Collection)」(ID: integration_volcengine_mongodb_sharded_cluster
).
After clicking 【Install】, enter the corresponding parameters: Volcengine AK, Volcengine account name.
Click 【Deploy Startup Script】, the system will automatically create the Startup
script set and configure the corresponding startup scripts.
After enabling, you can see the corresponding automatic trigger configuration in 「Manage / Automatic Trigger Configuration」. Click 【Execute】 to immediately execute once without waiting for the scheduled time. After a short wait, you can check the execution task records and corresponding logs.
If you want to collect corresponding logs, you also need to enable the corresponding log collection script. If you want to collect bills, you need to enable the cloud bill collection script.
Verification¶
- In 「Manage / Automatic Trigger Configuration」, confirm whether the corresponding task has the corresponding automatic trigger configuration, and check the corresponding task records and logs to see if there are any exceptions.
- In TrueWatch, check if asset information exists in 「Infrastructure / Custom」.
- In TrueWatch, check if there is corresponding monitoring data in 「Metrics」.
Metrics¶
After configuring Volcengine-MongoDB sharded cluster monitoring, the default Measurement is as follows. More metrics can be collected through configuration Volcengine MongoDB Monitoring Metrics Details
MetricName |
Subnamespace |
Metric Name | MetricUnit | Dimension |
---|---|---|---|---|
LogDiskUsage |
config |
Log Disk Usage | Bytes(SI) | ResourceID,Node |
RunningConcurrentWriteRequest |
config |
Current Write Concurrent Requests | Count | ResourceID,Node |
RunningConcurrentReadRequest |
config |
Current Read Concurrent Requests | Count | ResourceID,Node |
CommandOperationPerSec |
config |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
ReplicationDelay |
config |
Primary-Secondary Delay | Second | ResourceID,Node |
NetworkReceiveThroughput |
config |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUsage |
config |
Total Disk Usage | Bytes(SI) | ResourceID,Node |
UpdateOperationPerSec |
config |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
MaxDiskConfigured |
config |
Configured Maximum Available Disk Space | Bytes(SI) | ResourceID,Node |
TimeOutCursor |
config |
Cursor Timeouts | Count | ResourceID,Node |
CurrConn |
config |
Current Connections | Count | ResourceID,Node |
DataDiskUsage |
config |
Data Disk Usage | Bytes(SI) | ResourceID,Node |
ReadIntoCachePerSec |
config |
Data Read into Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUtil |
config |
Total Disk Utilization | Percent | ResourceID,Node |
GlobalWaitWriteLockQueue |
config |
Global Write Lock Wait Queue Length | Count | ResourceID,Node |
TotalOpenCursor |
config |
Total Open Cursors | Count | ResourceID,Node |
NetworkTransmitThroughput |
config |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GlobalWaitReadLockQueue |
config |
Global Read Lock Wait Queue Length | Count | ResourceID,Node |
AvailConcurrentReadRequest |
config |
Available Read Concurrent Requests | Count | ResourceID,Node |
MemUtil |
config |
Memory Utilization | Percent | ResourceID,Node |
GlobalWaitTotalLockQueue |
config |
Global Lock Wait Queue Total Length | Count | ResourceID,Node |
CpuUtil |
config |
CPU Utilization | Percent | ResourceID,Node |
GetmoreOperationPerSec |
config |
GETMORE Operations Per Second | Count/Second | ResourceID,Node |
DeleteOperationPerSec |
config |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
AvailConcurrentWriteRequest |
config |
Available Write Concurrent Requests | Count | ResourceID,Node |
QueryOperationPerSec |
config |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
WrittenFromCachePerSec |
config |
Data Written from Cache to Disk Per Second | Bytes/Second(SI) | ResourceID,Node |
NetworkRequestPerSec |
config |
Network Requests Processed Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
config |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
SlowOpCount |
config |
Slow Query Count | Count | ResourceID,Node |
OplogAvailTime |
config |
Oplog Available Time | Second | ResourceID,Node |
AggregatedCpuUtil |
instance |
CPU Utilization | Percent | ResourceID |
AggregatedMemUtil |
instance |
Memory Utilization | Percent | ResourceID |
AggregatedTotalDiskUtil |
instance |
Total Disk Space Utilization | Percent | ResourceID |
ChunkNumber |
instance |
Shard Chunk Count | Count | ResourceID,shard |
NetworkTransmitThroughput |
mongos |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GetmoreOperationPerSec |
mongos |
GETMORE Operations Per Second | Count/Second | ResourceID,Node |
MemUtil |
mongos |
Memory Utilization | Percent | ResourceID,Node |
CpuUtil |
mongos |
CPU Utilization | Percent | ResourceID,Node |
QueryOperationPerSec |
mongos |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
DeleteOperationPerSec |
mongos |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
mongos |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
NetworkRequestPerSec |
mongos |
Network Requests Processed Per Second | Count/Second | ResourceID,Node |
NetworkReceiveThroughput |
mongos |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
CommandOperationPerSec |
mongos |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
CurrConn |
mongos |
Current Connections | Count | ResourceID,Node |
UpdateOperationPerSec |
mongos |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
ReadIntoCachePerSec |
shard |
Data Read into Cache Per Second | Bytes/Second(SI) | ResourceID,Node |
TotalDiskUtil |
shard |
Total Disk Utilization | Percent | ResourceID,Node |
GlobalWaitWriteLockQueue |
shard |
Global Write Lock Wait Queue Length | Count | ResourceID,Node |
TotalOpenCursor |
shard |
Total Open Cursors | Count | ResourceID,Node |
NetworkTransmitThroughput |
shard |
Network Output Rate | Bytes/Second(SI) | ResourceID,Node |
GlobalWaitReadLockQueue |
shard |
Global Read Lock Wait Queue Length | Count | ResourceID,Node |
AvailConcurrentReadRequest |
shard |
Available Read Concurrent Requests | Count | ResourceID,Node |
DataDiskUsage |
shard |
Data Disk Usage | Bytes(SI) | ResourceID,Node |
GlobalWaitTotalLockQueue |
shard |
Global Lock Wait Queue Total Length | Count | ResourceID,Node |
CpuUtil |
shard |
CPU Utilization | Percent | ResourceID,Node |
GetmoreOperationPerSec |
shard |
GETMORE Operations Per Second | Count/Second | ResourceID,Node |
MemUtil |
shard |
Memory Utilization | Percent | ResourceID,Node |
AvailConcurrentWriteRequest |
shard |
Available Write Concurrent Requests | Count | ResourceID,Node |
QueryOperationPerSec |
shard |
QUERY Operations Per Second | Count/Second | ResourceID,Node |
DeleteOperationPerSec |
shard |
DELETE Operations Per Second | Count/Second | ResourceID,Node |
NetworkRequestPerSec |
shard |
Network Requests Processed Per Second | Count/Second | ResourceID,Node |
InsertOperationPerSec |
shard |
INSERT Operations Per Second | Count/Second | ResourceID,Node |
WrittenFromCachePerSec |
shard |
Data Written from Cache to Disk Per Second | Bytes/Second(SI) | ResourceID,Node |
RunningConcurrentWriteRequest |
shard |
Current Write Concurrent Requests | Count | ResourceID,Node |
LogDiskUsage |
shard |
Log Disk Usage | Bytes(SI) | ResourceID,Node |
CommandOperationPerSec |
shard |
COMMAND Operations Per Second | Count/Second | ResourceID,Node |
ReplicationDelay |
shard |
Primary-Secondary Delay | Second | ResourceID,Node |
NetworkReceiveThroughput |
shard |
Network Input Rate | Bytes/Second(SI) | ResourceID,Node |
RunningConcurrentReadRequest |
shard |
Current Read Concurrent Requests | Count | ResourceID,Node |
UpdateOperationPerSec |
shard |
UPDATE Operations Per Second | Count/Second | ResourceID,Node |
MaxDiskConfigured |
shard |
Configured Maximum Available Disk Space | Bytes(SI) | ResourceID,Node |
TimeOutCursor |
shard |
Cursor Timeouts | Count | ResourceID,Node |
CurrConn |
shard |
Current Connections | Count | ResourceID,Node |
TotalDiskUsage |
shard |
Total Disk Usage | Bytes(SI) | ResourceID,Node |
SlowOpCount |
shard |
Slow Query Count | Count | ResourceID,Node |
OplogAvailTime |
shard |
Oplog Available Time | Second | ResourceID,Node |
Objects¶
The collected Volcengine MongoDB object data structure can be seen in 「Infrastructure - Custom」
{
"category": "custom_object",
"fields": {
"NodeSpec": "rds.mysql.d1.n.1c1g",
"TimeZone": "UTC +08:00",
...
},
"measurement": "volcengine_mongodb_replica_set",
"tags": {
"AllowListVersion": "initial",
"DBEngineVersion": "MongoDB_5_7",
"InstanceId": "mysql-xxx",
"InstanceName": "mysql-xxx",
"InstanceStatus": "Running",
"InstanceType": "DoubleNode",
"LowerCaseTableNames": "1",
"NodeNumber": "2",
"ProjectName": "default",
"RegionId": "cn-beijing",
"StorageSpace": "20",
"StorageType": "LocalSSD",
"SubnetId": "subnet-xxx",
"VpcId": "vpc-xxx",
"ZoneId": "cn-beijing-a",
"name": "mysql-xxx"
}
}