Tencent Cloud CVM¶
Use the "Cloud Sync" series of script packages in the script market to synchronize cloud monitoring and cloud asset data to TrueWatch
Configuration¶
Install Func¶
It is recommended to activate the TrueWatch Integration - Extensions - DataFlux Func (Automata): All preconditions are automatically installed, please proceed with the script installation.
If you deploy Func yourself, refer to Self-deploying Func
Install Script¶
Note: Please prepare the Tencent Cloud AK that meets the requirements in advance (for simplicity, you can directly grant the global read-only permission
ReadOnlyAccess
)
-
Log in to the Func console, click 【Script Market】, enter the TrueWatch script market, and search for:
integration_tencentcloud_cvm
-
Click 【Install】, then enter the corresponding parameters: Tencent Cloud AK ID, AK Secret, and account name.
-
Click 【Deploy Startup Script】, the system will automatically create the
Startup
script set and configure the corresponding startup script. -
After enabling, you can see the corresponding automatic trigger configuration in 「Management / Automatic Trigger Configuration」. Click 【Execute】 to execute it immediately without waiting for the scheduled time. After a while, you can check the execution task records and corresponding logs.
Verification¶
- Confirm in 「Management / Automatic Trigger Configuration」 whether the corresponding task has the corresponding automatic trigger configuration, and you can also check the corresponding task records and logs to see if there are any exceptions.
- In TrueWatch, check in 「Infrastructure / Custom」 whether the asset information exists.
- In TrueWatch, check in 「Metrics」 whether there is corresponding monitoring data.
Metrics¶
After configuring Tencent Cloud - Cloud Monitoring, the default Measurement is as follows. You can collect more metrics through configuration. Tencent Cloud Monitoring Metrics Details
CPU Monitoring¶
Metric Name | Metric Chinese Name | Description | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
CpuUsage |
CPU Utilization | The percentage of CPU occupied in real-time during machine operation | % | InstanceId |
10s、60s、300s、3600s、86400s |
CpuLoadavg |
CPU One-Minute Average Load | The average number of tasks using and waiting to use CPU in 1 minute (Windows machines do not have this metric) | - | InstanceId |
10s、60s、300s、3600s、86400s |
Cpuloadavg5m |
CPU Five-Minute Average Load | The average number of tasks using and waiting to use CPU in 5 minutes (Windows machines do not have this metric) | - | InstanceId |
60s、300s、3600s |
Cpuloadavg15m |
CPU Fifteen-Minute Average Load | The average number of tasks using and waiting to use CPU in 15 minutes (Windows machines do not have this metric) | - | InstanceId |
60s、300s、3600s |
BaseCpuUsage |
Base CPU Utilization | Base CPU utilization is collected and reported by the host machine, and data can be viewed without installing monitoring components. Sub-machines can still continuously collect and report data under high load conditions. | % | InstanceId |
10s、60s、300s、3600s、86400s |
GPU Monitoring¶
Metric Name | Metric Chinese Name | Description | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
GpuMemTotal |
GPU Total Memory | GPU total memory | MB | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuMemUsage |
GPU Memory Utilization | GPU memory utilization | % | InstanceId |
10s、60s、300s、3600s、86400s |
GpuMemUsed |
GPU Memory Usage | Evaluate the load on memory usage | MB | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuPowDraw |
GPU Power Usage | GPU power usage | W | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuPowLimit |
GPU Total Power | GPU total power | W | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuPowUsage |
GPU Power Utilization | GPU power utilization | % | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuTemp |
GPU Temperature | Evaluate GPU heat dissipation status | °C | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
GpuUtil |
GPU Utilization | Evaluate the computing power consumed by the load, the percentage of non-idle state | % | InstanceId |
10s、 60s、 300s、 3600s、 86400s |
Network Monitoring¶
Metric Name | Metric Chinese Name | Description | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
LanOuttraffic |
Internal Network Outbound Bandwidth | The average outbound traffic per second of the internal network card | Mbps | InstanceId |
10s、60s、300s、3600s、86400s |
LanIntraffic |
Internal Network Inbound Bandwidth | The average inbound traffic per second of the internal network card | Mbps | InstanceId |
10s、60s、300s、3600s、86400s |
LanOutpkg |
Internal Network Outbound Packet Rate | The average outbound packet rate per second of the internal network card | Packets/sec | InstanceId |
10s、60s、300s、3600s、86400s |
LanInpkg |
Internal Network Inbound Packet Rate | The average inbound packet rate per second of the internal network card | Packets/sec | InstanceId |
10s、60s、300s、3600s、86400s |
WanOuttraffic |
External Network Outbound Bandwidth | The average outbound traffic rate per second of the external network, the minimum granularity data is the total outbound traffic in 10 seconds divided by 10 seconds. This data is the sum of the outbound/inbound bandwidth of EIP+CLB+CVM | Mbps | InstanceId |
10s、60s、300s、3600s、86400s |
WanIntraffic |
External Network Inbound Bandwidth | The average inbound traffic rate per second of the external network, the minimum granularity data is the total inbound traffic in 10 seconds divided by 10 seconds. This data is the sum of the outbound/inbound bandwidth of EIP+CLB+CVM | Mbps | InstanceId |
10s、60s、300s、3600s、86400s |
WanOutpkg |
External Network Outbound Packet Rate | The average outbound packet rate per second of the external network card | Packets/sec | InstanceId |
10s、60s、300s、3600s、86400s |
WanInpkg |
External Network Inbound Packet Rate | The average inbound packet rate per second of the external network card | Packets/sec | InstanceId |
10s、60s、300s、3600s、86400s |
AccOuttraffic |
External Network Outbound Traffic | The average outbound traffic per second of the external network card | MB | InstanceId |
10s、60s、300s、3600s、86400s |
TcpCurrEstab |
TCP Connection Count | The number of TCP connections in the ESTABLISHED state | Count | InstanceId |
10s、60s、300s、3600s、86400s |
TimeOffset |
UTC Time and NTP Time Difference of the Sub-machine | The difference between the UTC time and NTP time of the sub-machine | Seconds | InstanceId |
60s、300s、3600s、86400s |
Memory Monitoring¶
Metric Name | Metric Chinese Name | Description | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
MemUsed |
Memory Usage | The amount of memory actually used by the user, excluding the memory occupied by buffers and system caches. Total memory - Available memory (including buffers and cached) to get the memory usage value, excluding buffers and cached | MB | InstanceId |
10s、60s、300s、3600s、86400s |
MemUsage |
Memory Utilization | The actual memory utilization by the user, excluding the memory occupied by buffers and system caches. Excluding cache, buffer, and remaining, the ratio of memory actually used by the user to the total memory | % | InstanceId |
10s、60s、300s、3600s、86400s |
Disk Monitoring¶
Metric Name | Metric Chinese Name | Description | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
CvmDiskUsage |
Disk Utilization | The percentage of disk used capacity to total capacity (all disks) | % | InstanceId |
60s、300s |
RDMA Monitoring¶
Metric Name | Metric Chinese Name | Metric Description (Optional) | Unit | Dimensions | Statistical Granularity |
---|---|---|---|---|---|
RdmaIntraffic |
RDMA Network Card Receive Bandwidth | RDMA network card receive bandwidth | Mbps | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaOuttraffic |
RDMA Network Card Send Bandwidth | RDMA network card send bandwidth | Mbps | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaInpkt |
RDMA Network Card Inbound Packet Rate | RDMA network card inbound packet rate | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaOutpkt |
RDMA Network Card Outbound Packet Rate | RDMA network card outbound packet rate | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
CnpCount |
CNP Statistics | Congestion Notification Packet Statistics | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
EcnCount |
ECN Statistics | Explicit Congestion Notification Statistics | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaPktDiscard |
End-to-End Packet Loss | End-to-end packet loss | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaOutOfSequence |
Receiver Out-of-Order Error Rate | Receiver out-of-order error rate | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
RdmaTimeoutCount |
Sender Timeout Error Rate | Sender timeout error rate | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
TxPfcCount |
TX PFC Statistics | TX PFC Statistics | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
RxPfcCount |
RX PFC Statistics | RX PFC Statistics | Packets/sec | InstanceId |
60s、 300s、 3600s、 86400s |
Objects¶
The collected Tencent Cloud CVM object data structure can be viewed in 「Infrastructure - Custom」.
{
"measurement": "tencentcloud_cvm",
"tags": {
"name" : "ins-bahxxxx",
"RegionId" : "ap-shanghai",
"Zone" : "ap-shanghai-1",
"InstanceId" : "ins-bahxxxx",
"InstanceChargeType": "POSTPAID_BY_HOUR",
"InstanceType" : "SA2.MEDIUM2",
"OsName" : "TencentOS Server 3.1 (TK4)"
},
"fields": {
"CPU" : 2,
"Memory" : 2,
"InstanceState" : "RUNNING",
"PublicIpAddresses" : "{Public IP Data}",
"PrivateIpAddresses": "{Private IP Data}",
"SystemDisk" : "{System Disk JSON Data}",
"DataDisks" : "{Data Disk JSON Data}",
"Placement" : "{Region JSON Data}",
"ExpiredTime" : "2022-05-07T01:51:38Z",
"message" : "{Instance JSON Data}"
}
}