Skip to content

Ranger Tagsync

Collect Ranger Tagsync metrics

Configuration

1. Ranger Tagsync Configuration

1.1 Download jmx-exporter

Download URL: https://github.com/prometheus/jmx_exporter

1.2 Download jmx script

Download URL: https://github.com/lrwh/jmx-exporter/blob/main/common.yml

1.3 Ranger Tagsync Startup Parameter Adjustment

Add the following to the Ranger Tagsync startup parameters:

{JAVA_GC_ARGS} -javaagent:/opt/jmx/jmx_exporter-1.0.1.jar=localhost:17144:/opt/jmx/common.yml

1.4 Restart Ranger Tagsync

2. DataKit Collector Configuration

2.1 Install DataKit

2.2 Configure Collector

Since jmx-exporter directly exposes the metrics URL, it can be collected directly using the prom collector.

Navigate to conf.d/prom under the DataKit installation directory, and copy prom.conf.sample to ranger-tagsync.conf.

cp prom.conf.sample ranger-tagsync.conf

Modify the contents of ranger_tagsync.conf as follows:

  urls = ["http://localhost:17144/metrics"]
  source ="ranger-tagsync"
  [inputs.prom.tags]
    component = "ranger-tagsync" 
  interval = "10s"

Adjust other configurations as needed, parameter description:

  • urls: jmx-exporter metrics URL, fill in the metrics URL exposed by the corresponding component
  • source: Collector alias, recommended to distinguish
  • keep_exist_metric_name: Keep the metric name
  • interval: Collection interval
  • inputs.prom.tags: Add additional tags

3. Restart DataKit

Restart Datakit

Metrics

Hadoop Measurement

Ranger Tagsync metrics are located under the Hadoop Measurement, here mainly introduces the description of Ranger Tagsync related metrics

Metrics Description Unit
hadoop_tagsync_countevents Total number of events occurred count
hadoop_tagsync_counteventstime Total time spent processing these events ms
hadoop_tagsync_countuploads Total number of upload operations count
hadoop_tagsync_countuploadstime Total time spent processing upload operations count
hadoop_tagsync_droppedpuball Total number of lost publish events count
hadoop_tagsync_gccounttotal Total number of garbage collection (GC) count
hadoop_tagsync_gctimemax Maximum time spent on garbage collection (GC) ms
hadoop_tagsync_gctimetotal Total time spent on garbage collection (GC) count
hadoop_tagsync_getgroupsavgtime Average time to get user groups ms
hadoop_tagsync_getgroupsnumops Number of operations to get user groups count
hadoop_tagsync_loginfailureavgtime Average time spent on login failures ms
hadoop_tagsync_loginfailurenumops Number of login failures count
hadoop_tagsync_loginsuccessavgtime Average time spent on successful logins ms
hadoop_tagsync_loginsuccessnumops Number of successful logins count
hadoop_tagsync_memorycurrent Current memory usage count
hadoop_tagsync_memorymax Maximum memory usage count
hadoop_tagsync_numactivesinks Number of active sinks count
hadoop_tagsync_numactivesources Number of active data sources count
hadoop_tagsync_numallsinks Total number of all data sources count
hadoop_tagsync_processorsavailable Number of available processors count
hadoop_tagsync_publishavgtime Average time spent on publish operations count
hadoop_tagsync_publishnumops Number of publish operations count
hadoop_tagsync_renewalfailures Number of renewal failures count
hadoop_tagsync_renewalfailurestotal Total number of renewal failures count
hadoop_tagsync_sink_jsonavgtime Average time spent on JSON sinks count
hadoop_tagsync_sink_jsondropped Number of messages dropped by JSON sinks count
hadoop_tagsync_sink_jsonnumops Number of JSON sink operations count
hadoop_tagsync_sink_jsonqsize Queue size of JSON sinks count
hadoop_tagsync_sink_prometheusavgtime Average time spent on Prometheus sinks count
hadoop_tagsync_sink_prometheusdropped Number of messages dropped by Prometheus sinks count
hadoop_tagsync_sink_prometheusnumops Number of Prometheus sink operations count
hadoop_tagsync_sink_prometheusqsize Queue size of Prometheus sinks count
hadoop_tagsync_snapshotavgtime Average time spent on snapshot operations count
hadoop_tagsync_snapshotnumops Number of snapshot operations count
hadoop_tagsync_systemloadavg System average load count
hadoop_tagsync_threadsblocked Number of blocked threads count
hadoop_tagsync_threadsbusy Number of busy threads count
hadoop_tagsync_threadsremaining Number of remaining threads count
hadoop_tagsync_threadswaiting Number of waiting threads count