Ranger Tagsync
Collect Ranger Tagsync metrics
Configuration¶
1. Ranger Tagsync Configuration¶
1.1 Download jmx-exporter¶
Download URL: https://github.com/prometheus/jmx_exporter
1.2 Download jmx script¶
Download URL: https://github.com/lrwh/jmx-exporter/blob/main/common.yml
1.3 Ranger Tagsync Startup Parameter Adjustment¶
Add the following to the Ranger Tagsync startup parameters:
{JAVA_GC_ARGS} -javaagent:/opt/jmx/jmx_exporter-1.0.1.jar=localhost:17144:/opt/jmx/common.yml
1.4 Restart Ranger Tagsync¶
2. DataKit Collector Configuration¶
2.1 Install DataKit¶
2.2 Configure Collector¶
Since jmx-exporter directly exposes the metrics
URL, it can be collected directly using the prom collector.
Navigate to conf.d/prom
under the DataKit installation directory, and copy prom.conf.sample
to ranger-tagsync.conf
.
cp prom.conf.sample ranger-tagsync.conf
Modify the contents of ranger_tagsync.conf
as follows:
urls = ["http://localhost:17144/metrics"]
source ="ranger-tagsync"
[inputs.prom.tags]
component = "ranger-tagsync"
interval = "10s"
Adjust other configurations as needed, parameter description:
- urls:
jmx-exporter
metrics URL, fill in the metrics URL exposed by the corresponding component - source: Collector alias, recommended to distinguish
- keep_exist_metric_name: Keep the metric name
- interval: Collection interval
- inputs.prom.tags: Add additional tags
3. Restart DataKit¶
Metrics¶
Hadoop Measurement¶
Ranger Tagsync metrics are located under the Hadoop Measurement, here mainly introduces the description of Ranger Tagsync related metrics
Metrics | Description | Unit |
---|---|---|
hadoop_tagsync_countevents |
Total number of events occurred |
count |
hadoop_tagsync_counteventstime |
Total time spent processing these events |
ms |
hadoop_tagsync_countuploads |
Total number of upload operations |
count |
hadoop_tagsync_countuploadstime |
Total time spent processing upload operations |
count |
hadoop_tagsync_droppedpuball |
Total number of lost publish events |
count |
hadoop_tagsync_gccounttotal |
Total number of garbage collection (GC) |
count |
hadoop_tagsync_gctimemax |
Maximum time spent on garbage collection (GC) |
ms |
hadoop_tagsync_gctimetotal |
Total time spent on garbage collection (GC) |
count |
hadoop_tagsync_getgroupsavgtime |
Average time to get user groups |
ms |
hadoop_tagsync_getgroupsnumops |
Number of operations to get user groups |
count |
hadoop_tagsync_loginfailureavgtime |
Average time spent on login failures |
ms |
hadoop_tagsync_loginfailurenumops |
Number of login failures |
count |
hadoop_tagsync_loginsuccessavgtime |
Average time spent on successful logins |
ms |
hadoop_tagsync_loginsuccessnumops |
Number of successful logins |
count |
hadoop_tagsync_memorycurrent |
Current memory usage |
count |
hadoop_tagsync_memorymax |
Maximum memory usage |
count |
hadoop_tagsync_numactivesinks |
Number of active sinks |
count |
hadoop_tagsync_numactivesources |
Number of active data sources |
count |
hadoop_tagsync_numallsinks |
Total number of all data sources |
count |
hadoop_tagsync_processorsavailable |
Number of available processors |
count |
hadoop_tagsync_publishavgtime |
Average time spent on publish operations |
count |
hadoop_tagsync_publishnumops |
Number of publish operations |
count |
hadoop_tagsync_renewalfailures |
Number of renewal failures |
count |
hadoop_tagsync_renewalfailurestotal |
Total number of renewal failures |
count |
hadoop_tagsync_sink_jsonavgtime |
Average time spent on JSON sinks |
count |
hadoop_tagsync_sink_jsondropped |
Number of messages dropped by JSON sinks |
count |
hadoop_tagsync_sink_jsonnumops |
Number of JSON sink operations |
count |
hadoop_tagsync_sink_jsonqsize |
Queue size of JSON sinks |
count |
hadoop_tagsync_sink_prometheusavgtime |
Average time spent on Prometheus sinks |
count |
hadoop_tagsync_sink_prometheusdropped |
Number of messages dropped by Prometheus sinks |
count |
hadoop_tagsync_sink_prometheusnumops |
Number of Prometheus sink operations |
count |
hadoop_tagsync_sink_prometheusqsize |
Queue size of Prometheus sinks |
count |
hadoop_tagsync_snapshotavgtime |
Average time spent on snapshot operations |
count |
hadoop_tagsync_snapshotnumops |
Number of snapshot operations |
count |
hadoop_tagsync_systemloadavg |
System average load |
count |
hadoop_tagsync_threadsblocked |
Number of blocked threads |
count |
hadoop_tagsync_threadsbusy |
Number of busy threads |
count |
hadoop_tagsync_threadsremaining |
Number of remaining threads |
count |
hadoop_tagsync_threadswaiting |
Number of waiting threads |
count |