plugeth/metrics
turboboost55 544e4a700b
metrics: improve accuracy of CPU gauges (#26793)
This PR changes metrics collection to actually measure the time interval between collections, rather
than assume 3 seconds. I did some ad hoc profiling, and on slower hardware (eg, my Raspberry Pi 4)
I routinely saw intervals between 3.3 - 3.5 seconds, with some being as high as 4.5 seconds. This
will generally cause the CPU gauge readings to be too high, and in some cases can cause impossibly
large values for the CPU load metrics (eg. greater than 400 for a 4 core CPU).

---------

Co-authored-by: Felix Lange <fjl@twurst.com>
2023-03-07 00:29:48 +01:00
..
exp all: remove unneeded parentheses (#21921) 2021-02-02 11:32:44 +02:00
influxdb metrics/influxdb: fix time ticker leaks (#26507) 2023-01-17 13:45:35 +01:00
librato all: use http package to replace http method names (#26535) 2023-01-24 11:12:25 +02:00
prometheus all: fix some typos (#25551) 2022-08-19 09:00:21 +03:00
config.go cmd, metrics: add support for influxdb-v2 (cherry-picking from italoacasas' changes), leave existing support for v1 to maintain backwards-compatibility. (#23194) 2021-08-17 18:40:14 +02:00
counter_test.go metrics: fix issues reported by staticcheck (#20365) 2019-11-22 16:04:35 +01:00
counter.go
cpu_disabled.go all: add go:build lines (#23468) 2021-08-25 18:46:29 +02:00
cpu_enabled.go metrics: improve accuracy of CPU gauges (#26793) 2023-03-07 00:29:48 +01:00
cpu.go metrics: improve accuracy of CPU gauges (#26793) 2023-03-07 00:29:48 +01:00
cputime_nop.go metrics: improve accuracy of CPU gauges (#26793) 2023-03-07 00:29:48 +01:00
cputime_unix.go metrics: improve accuracy of CPU gauges (#26793) 2023-03-07 00:29:48 +01:00
debug_test.go
debug.go
disk_linux.go
disk_nop.go all: add go:build lines (#23468) 2021-08-25 18:46:29 +02:00
disk.go
doc.go travis: enable test suite on ARM64 (#20219) 2019-11-08 10:58:57 +02:00
ewma_test.go travis: enable test suite on ARM64 (#20219) 2019-11-08 10:58:57 +02:00
ewma.go metrics: make meter updates lock-free (#21446) 2020-08-18 11:27:04 +02:00
FORK.md
gauge_float64_test.go all: fix some typos (#25551) 2022-08-19 09:00:21 +03:00
gauge_float64.go
gauge_test.go all: fix some typos (#25551) 2022-08-19 09:00:21 +03:00
gauge.go core, metrics, p2p: switch some invalid counters to gauges 2019-09-10 14:39:07 +03:00
graphite_test.go
graphite.go
healthcheck.go
histogram_test.go metrics: fix issues reported by staticcheck (#20365) 2019-11-22 16:04:35 +01:00
histogram.go eth/protocols, metrics, p2p: add handler performance metrics 2021-03-26 14:00:06 +02:00
init_test.go
json_test.go metrics: fix issues reported by staticcheck (#20365) 2019-11-22 16:04:35 +01:00
json.go
LICENSE
log.go
memory.md
meter_test.go metrics: zero temp variable in updateMeter (#21470) 2020-08-21 11:04:36 +03:00
meter.go metrics: zero temp variable in updateMeter (#21470) 2020-08-21 11:04:36 +03:00
metrics_test.go metrics: improve reading Go runtime metrics (#25886) 2022-11-11 13:16:13 +01:00
metrics.go metrics: improve accuracy of CPU gauges (#26793) 2023-03-07 00:29:48 +01:00
opentsdb_test.go
opentsdb.go metrics: remove redundant type specifiers (#19090) 2019-02-18 13:37:31 +02:00
README.md metrics: change links in README.md to https (#20182) 2019-10-20 12:25:25 +02:00
registry_test.go all: add whitespace linter (#25312) 2022-07-25 13:14:03 +03:00
registry.go swarm/metrics: Send the accounting registry to InfluxDB (#18470) 2019-01-24 18:57:20 +01:00
resetting_sample.go eth/protocols, metrics: use resetting histograms for rare packets 2021-03-26 16:14:12 +02:00
resetting_timer_test.go
resetting_timer.go
runtimehistogram_test.go metrics: improve reading Go runtime metrics (#25886) 2022-11-11 13:16:13 +01:00
runtimehistogram.go metrics: improve reading Go runtime metrics (#25886) 2022-11-11 13:16:13 +01:00
sample_test.go all: remove deprecated uses of math.rand (#26710) 2023-02-16 14:36:58 -05:00
sample.go all: remove deprecated uses of math.rand (#26710) 2023-02-16 14:36:58 -05:00
syslog.go all: add go:build lines (#23468) 2021-08-25 18:46:29 +02:00
timer_test.go metrics: improve TestTimerFunc (#20818) 2020-03-31 15:01:16 +02:00
timer.go metrics: fix issues reported by staticcheck (#20365) 2019-11-22 16:04:35 +01:00
validate.sh
writer_test.go
writer.go

go-metrics

travis build status

Go port of Coda Hale's Metrics library: https://github.com/dropwizard/metrics.

Documentation: https://godoc.org/github.com/rcrowley/go-metrics.

Usage

Create and update metrics:

c := metrics.NewCounter()
metrics.Register("foo", c)
c.Inc(47)

g := metrics.NewGauge()
metrics.Register("bar", g)
g.Update(47)

r := NewRegistry()
g := metrics.NewRegisteredFunctionalGauge("cache-evictions", r, func() int64 { return cache.getEvictionsCount() })

s := metrics.NewExpDecaySample(1028, 0.015) // or metrics.NewUniformSample(1028)
h := metrics.NewHistogram(s)
metrics.Register("baz", h)
h.Update(47)

m := metrics.NewMeter()
metrics.Register("quux", m)
m.Mark(47)

t := metrics.NewTimer()
metrics.Register("bang", t)
t.Time(func() {})
t.Update(47)

Register() is not threadsafe. For threadsafe metric registration use GetOrRegister:

t := metrics.GetOrRegisterTimer("account.create.latency", nil)
t.Time(func() {})
t.Update(47)

NOTE: Be sure to unregister short-lived meters and timers otherwise they will leak memory:

// Will call Stop() on the Meter to allow for garbage collection
metrics.Unregister("quux")
// Or similarly for a Timer that embeds a Meter
metrics.Unregister("bang")

Periodically log every metric in human-readable form to standard error:

go metrics.Log(metrics.DefaultRegistry, 5 * time.Second, log.New(os.Stderr, "metrics: ", log.Lmicroseconds))

Periodically log every metric in slightly-more-parseable form to syslog:

w, _ := syslog.Dial("unixgram", "/dev/log", syslog.LOG_INFO, "metrics")
go metrics.Syslog(metrics.DefaultRegistry, 60e9, w)

Periodically emit every metric to Graphite using the Graphite client:


import "github.com/cyberdelia/go-metrics-graphite"

addr, _ := net.ResolveTCPAddr("tcp", "127.0.0.1:2003")
go graphite.Graphite(metrics.DefaultRegistry, 10e9, "metrics", addr)

Periodically emit every metric into InfluxDB:

NOTE: this has been pulled out of the library due to constant fluctuations in the InfluxDB API. In fact, all client libraries are on their way out. see issues #121 and #124 for progress and details.

import "github.com/vrischmann/go-metrics-influxdb"

go influxdb.InfluxDB(metrics.DefaultRegistry,
  10e9, 
  "127.0.0.1:8086", 
  "database-name", 
  "username", 
  "password"
)

Periodically upload every metric to Librato using the Librato client:

Note: the client included with this repository under the librato package has been deprecated and moved to the repository linked above.

import "github.com/mihasya/go-metrics-librato"

go librato.Librato(metrics.DefaultRegistry,
    10e9,                  // interval
    "example@example.com", // account owner email address
    "token",               // Librato API token
    "hostname",            // source
    []float64{0.95},       // percentiles to send
    time.Millisecond,      // time unit
)

Periodically emit every metric to StatHat:

import "github.com/rcrowley/go-metrics/stathat"

go stathat.Stathat(metrics.DefaultRegistry, 10e9, "example@example.com")

Maintain all metrics along with expvars at /debug/metrics:

This uses the same mechanism as the official expvar but exposed under /debug/metrics, which shows a json representation of all your usual expvars as well as all your go-metrics.

import "github.com/rcrowley/go-metrics/exp"

exp.Exp(metrics.DefaultRegistry)

Installation

go get github.com/rcrowley/go-metrics

StatHat support additionally requires their Go client:

go get github.com/stathat/go

Publishing Metrics

Clients are available for the following destinations: