28 Jun 2020
prometheus remote write adapter
remote write adapter
以下代码是我参考prometheus官方给出的remote write adapter修改的:
// Copyright 2016 The Prometheus Authors
// Licensed under the Apache License, Version 2.0 (the "License");
// you may not use this file except in compliance with the License.
// You may obtain a copy of the License at
//
// http://www.apache.org/licenses/LICENSE-2.0
//
// Unless required by applicable law or agreed to in writing, software
// distributed under the License is distributed on an "AS IS" BASIS,
// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
// See the License for the specific language governing permissions and
// limitations under the License.
package main
import (
"fmt"
"io/ioutil"
"log"
"net/http"
"github.com/gogo/protobuf/proto"
"github.com/golang/snappy"
"github.com/prometheus/common/model"
"github.com/prometheus/prometheus/prompb"
)
func main() {
fmt.Println("start remote write http server ...")
http.HandleFunc("/receive", func(w http.ResponseWriter, r *http.Request) {
compressed, err := ioutil.ReadAll(r.Body)
if err != nil {
http.Error(w, err.Error(), http.StatusInternalServerError)
return
}
reqBuf, err := snappy.Decode(nil, compressed)
if err != nil {
http.Error(w, err.Error(), http.StatusBadRequest)
return
}
var req prompb.WriteRequest
if err := proto.Unmarshal(reqBuf, &req); err != nil {
http.Error(w, err.Error(), http.StatusBadRequest)
return
}
for _, ts := range req.Timeseries {
m := make(model.Metric, len(ts.Labels))
for _, l := range ts.Labels {
m[model.LabelName(l.Name)] = model.LabelValue(l.Value)
}
fmt.Printf("time series data = <%v> ", m)
for _, s := range ts.Samples {
fmt.Printf("samples.Value = <%f> samples.Timestamp = <%d> seconds\n", s.Value, s.Timestamp/1000)
}
}
})
log.Fatal(http.ListenAndServe(":1234", nil))
}
enable remote write for prometheus
在prometheus的配置文件prometheus.yml中添加以下配置,启用remote write功能。
remote_write:
- url: "http://192.168.1.105:1234/receive"
这里仅保留go_info指标数据,完整的prometheus.yml如下:
# my global config
global:
scrape_interval: 60s # Set the scrape interval to every 15 seconds. Default is every 1 minute.
evaluation_interval: 15s # Evaluate rules every 15 seconds. The default is every 1 minute.
# scrape_timeout is set to the global default (10s).
# send all data to url
remote_write:
- url: "http://192.168.1.105:1234/receive"
write_relabel_configs:
- action: keep
source_labels: [__name__]
regex: go_info
# A scrape configuration containing exactly one endpoint to scrape:
# Here it's Prometheus itself.
scrape_configs:
# The job name is added as a label `job=<job_name>` to any timeseries scraped from this config.
- job_name: 'ssli-prometheus'
# scrape_interval: 20s
scrape_interval: 5s
# metrics_path defaults to '/metrics'
# scheme defaults to 'http'.
static_configs:
- targets: ['192.168.1.105:9090']
启动prometheus
启动prometheus和remote write adapter,查看remote write adapter收到的监控数据。
1 启动prometheus
$ docker run -d --rm -p 9090:9090 -v /Users/ssli/share/git/k8s_practice/prometheus/remote_write/prometheus.yml:/etc/prometheus/prometheus.yml prom/prometheus
8ff76a4d1ac80a275b755102785cfa6d014b8889b2e28f9d99f8f8a535e2fc1c
$ docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
8ff76a4d1ac8 prom/prometheus "/bin/prometheus --c…" 4 seconds ago Up 3 seconds 0.0.0.0:9090->9090/tcp angry_kapitsa
输入 http://localhost:9090/graph 可以看到prometheus的ui界面。
2 启动remote write adapter
remote write adapter会接收到prometheus发送的监控数据,可以看到每隔5s收到一条指标数据。
$ grn remote-write-adapter.go
start remote write http server ...
time series data = <go_info{instance="192.168.1.105:9090", job="ssli-prometheus", version="go1.14.4"}> samples.Value = <1.000000> samples.Timestamp = <1593350133> seconds
time series data = <go_info{instance="192.168.1.105:9090", job="ssli-prometheus", version="go1.14.4"}> samples.Value = <1.000000> samples.Timestamp = <1593350138> seconds
time series data = <go_info{instance="192.168.1.105:9090", job="ssli-prometheus", version="go1.14.4"}> samples.Value = <1.000000> samples.Timestamp = <1593350143> seconds
time series data = <go_info{instance="192.168.1.105:9090", job="ssli-prometheus", version="go1.14.4"}> samples.Value = <1.000000> samples.Timestamp = <1593350148> seconds
...
如果remote write配置了queue_config,且batch_send_deadline配置的时间比job级的scrape_interval小,那么每隔scrape_interval,remote write adapter会接收到prometheus发送的一条监控数据,监控数据之间的时间戳相差scrape_interval秒。如果batch_send_deadline配置的时间比job级的scrape_interval大,那么每隔batch_send_deadline,remote write adapter会接收到prometheus发送的多条监控数据,同样每条数据之间的时间戳相差scrape_interval秒。我理解当batch_send_deadline小于scrape_interval时,即使达到了batch_send_deadline,由于prometheus没有采集到数据,所以没有数据可发送。只有batch_send_deadline大于scrape_interval时,当达到了batch_send_deadline,prometheus会将采集的多条数据一次性发送出去。
ref
- https://github.com/prometheus/prometheus/tree/master/documentation/examples/remote_storage/example_write_adapter
- https://raw.githubusercontent.com/prometheus/prometheus/master/documentation/examples/remote_storage/example_write_adapter/server.go
- https://github.com/prometheus/prometheus/blob/master/documentation/examples/prometheus.yml
LEo
at 12:12