Prometheus通过remote_write写入数据到另一台Prometheus

发表于2022-06-09|更新于2025-07-11|监控

|浏览量:

假设 Prometheus1 是一个集群内的 Prometheus，需要远程写入数据到 Prometheus_Core

Prometheus_Core 开启 remote_write_receiver

Prometheus_Core 需要打开接收远程写入的功能，通过增加启动参数 --web.enable-remote-write-receiver：

1	./prometheus --web.enable-remote-write-receiver --web.config.file=web.yml --web.listen-address=0.0.0.0:9090

远程写的接口地址 /api/v1/write

Prometheus_Core 开启认证

参考 Prometheus开启basic_auth认证

Prometheus1 配置 remote_write

Prometheus1 需要将 remote_write 写入到 Prometheus_Core 的远程接口

remote_write:
- url: "http://127.0.0.1:9090/api/v1/write"
  basic_auth:                   # 开启认证后需要配置
    username: admin             # 开启认证后需要配置
    password: xxxxxx            # 开启认证后需要配置
  remote_timeout: 30s
  tls_config:
    insecure_skip_verify: true
  queue_config:
    capacity: 500
    max_shards: 1000
    min_shards: 1
    max_samples_per_send: 100
    batch_send_deadline: 5s
    min_backoff: 30ms
    max_backoff: 100ms

远程写如何判断监控是否离线

远程写不像联邦集群，联邦是事先配置好的，远程写的方式，被写入的 Prometheus 是无法预知是谁在向我发送数据的。可以通过 absent 函数来实现。

收集 Prometheus 自身的监控指标

想要监控 Prometheus 的状态，则需要收集自身的监控指标（比如 up 指标），确保 Prometheus1 中有如下配置：

scrape_configs:
- job_name: 'prometheus'
  static_configs:
    - targets: ['localhost:9090']

在 Prometheus_Core 的告警配置中做如下配置，其中 absent 在查询不到数据的情况下返回 1，由于 absent 返回的数据不带标签，所以独立写成一条告警规则。

groups:
- name: Prometheus1推送告警
  rules:
  - alert: "Prometheus1推送离线"
    expr: absent(up{job=~"prometheus",project=~"prometheus1"}) == 1
    for: 3m
    labels:
      severity: critical
    annotations:
      description: '远程写 Prometheus1 无数据'
      summary: "远程写 Prometheus1 无数据"

文章作者: 张理坤

文章链接: https://zahui.fan/posts/666e547f/

版权声明: 本博客所有文章除特别声明外，均采用 CC BY-NC-SA 4.0 许可协议。转载请注明来源杂烩饭！

prometheus Prometheus

相关推荐

Kubernetes中使用Prometheus对集群节点做监控

正常情况下使用 Prometheus 对机器做监控，比如监控 CPU、内存、磁盘等信息，都是在机器上安装一个 node exporter，然后将 metrics 接入到 Prometheus 中。在 k8s 环境下，我们可以使用 k8s 来管理，实现自动化监控。 node exporter 是针对主机节点的，需要在每台 node 节点上安装，那么 daemonset 控制器是最合理的选择。网络使用 Host Network 模式，在主机上直接暴露一个端口。部署 node exporter使用 yaml 123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263apiVersion: apps/v1kind: DaemonSetmetadata: name: node-exporter namespace: monitor labels: name:...

Prometheus常用PromQL记录

prometheus 查询语法叫 PromQL，做个记录：查询条件Prometheus 存储的是时序数据，而它的时序是由名字和一组标签构成的，其实名字也可以写出标签的形式，例如 http_requests_total 等价于 {name="http_requests_total"}。一个简单的查询相当于是对各种标签的筛选，例如： 1234http_requests_total{code="200"} # 表示查询名字为 http_requests_total，code 为 "200" 的数据http_requests_total{code!="200"} # 表示查询 code 不为 "200" 的数据http_requests_total{code=~"2.."} # 表示查询 code 为 "2xx"...

Prometheus手动打标签

有时候需要给 Prometheus 打标签，比如说联邦集群接入，需要知道是哪个集群，remote write 写入的时候也需要做个标记。直接在集群打标签vim prometheus.yml 12345678910global: scrape_interval: 15s # Set the scrape interval to every 15 seconds. Default is every 1 minute. evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute. # scrape_timeout is set to the global default (10s). external_labels: env: uat dept: ops project: xxx... 这样就可以给集群内的所有 metrics...

Prometheus开启basic_auth认证

考虑将公司的联邦集群（pull）换成 remote_write（push）这种形式，所以需要将 Prometheus 开放到公网，看了看认证相关的配置也可以使用 Nginx 来反向代理，可以参考 Nginx开启基本http认证，不过 Prometheus 原生带了 basic auth 和 ssl 认证, 官网的说明https://prometheus.io/docs/guides/basic-auth/ 开启 web 配置文件1./prometheus --web.config.file="web.yml" --web.listen-address="0.0.0.0:9001" 生成密码密码需要 bcrypt 加密，这里使用 htpasswd 工具生成 Ubuntu和Debian安装CentOS和Fedora安装1apt install apache2-utils1yum install httpd-tools 1htpasswd -nB 'admin' web 配置文件vim...

Prometheus标签处理

元标签在被监控端纳入普罗米修斯里面定义了一些元数据标签在 Prometheus 所有的 Target 实例中，都包含一些默认的 Metadata 标签信息。可以通过 Prometheus UI 的 Status 里面的 Service Discovery 查看 Metadata 标签说明 address 当前 Target 实例的访问地址 host:port scheme 采集目标服务访问地址的 HTTP Scheme，HTTP 或者 HTTPS metrics_path 采集目标服务访问地址的访问路径上面这些标签将会告诉 Prometheus 如何从该 Target 实例中获取监控数据。除了这些默认的标签以外，我们还可以为 Target 添加自定义的标签。元标签是不会写到数据库当中的，使用 promql 是查询不到这些标签的，如果需要源标签的数据（比如 k8s 部署的 Prometheus 使用自动发现获取 pod 监控），这个时候就需要把一些元标签重新打标签来使用。比如上图，监控 k8s 的 pod 状态，因为 pod 是动态的，所以需要...

Nginx Ingress 暴露没有定义Host的ingress的Metrics数据

指定默认的 ingress 后端名字有点绕口，假如说之前有个服务是通过 ip:port 来访问 nginx(就是 default server)，然后转发到后端服务的，那么转换成 ingress 后，不能指定 host，不然会匹配不到规则。ingress 就不能配置 host，创建出来的 ingress 资源就是这样的： 12345678910111213141516171819apiVersion: networking.k8s.io/v1beta1kind: Ingressmetadata: annotations: nginx.ingress.kubernetes.io/ssl-redirect: "false" name: g.example.com namespace: defaultspec: rules: - http: paths: - backend: serviceName: server1 servicePort: 5003 path: /sdk ...

评论

数据加载中