lighthouse

Author	SHA1	Message	Date
blacktemplar	3f0a113c7f	ban IP addresses if too many banned peers for this IP address (#1543 ) ## Issue Addressed #1283 ## Proposed Changes All peers with the same IP will be considered banned as long as there are more than 5 (constant) peers with this IP that have a score below the ban threshold. As soon as some of those 5 peers get unbanned (through decay) and if there are then less than 5 peers with a score below the threshold the IP will be considered not banned anymore.	2020-08-21 01:41:12 +00:00
Pawan Dhananjay	bbed42f30c	Refactor attestation service (#1415 ) ## Issue Addressed N/A ## Proposed Changes Refactor attestation service to send out requests to find peers for subnets as soon as we get attestation duties. Earlier, we had much more involved logic to send the discovery requests to the discovery service only 6 slots before the attestation slot. Now that discovery is much smarter with grouped queries, the complexity in attestation service can be reduced considerably. Co-authored-by: Age Manning <Age@AgeManning.com>	2020-08-19 08:46:25 +00:00
divma	fdc6e2aa8e	Shutdown like a Sir (#1545 ) ## Issue Addressed #1494 ## Proposed Changes - Give the TaskExecutor the sender side of a channel that a task can clone to request shutting down - The receiver side of this channel is in environment and now we block until ctrl+c or an internal shutdown signal is received - The swarm now informs when it has reached 0 listeners - The network receives this message and requests the shutdown	2020-08-19 05:51:14 +00:00
Age Manning	e1e5002d3c	Fingerprint Lodestar (#1536 ) Fingerprints the Lodestar client	2020-08-18 06:28:24 +00:00
Paul Hauner	a58aa6ee55	Revert back to discv5 alpha 8 to maintain ARM support (#1531 ) ## Issue Addressed NA ## Proposed Changes See title. ## Additional Info NA	2020-08-17 10:06:08 +00:00
Age Manning	3c689a6837	Remove yamux support (#1526 ) ## Issue Addressed There is currently an issue with yamux when connecting to prysm peers. The source of the issue is currently unknown. This PR removes yamux support to force mplex negotation. We can add back yamux support once we have isolated and corrected the issue.	2020-08-17 05:05:06 +00:00
Pawan Dhananjay	850a2d5985	Persist metadata and enr across restarts (#1513 ) ## Issue Addressed Resolves #1489 ## Proposed Changes - Change starting metadata seq num to 0 according to the [spec](https://github.com/ethereum/eth2.0-specs/blob/dev/specs/phase0/p2p-interface.md#metadata). - Remove metadata field from `NetworkGlobals` - Persist metadata to disk on every update - Load metadata seq number from disk on restart - Persist enr to disk on update to ensure enr sequence number increments are persisted as well. ## Additional info Since we modified starting metadata seq num to 0 from 1, we might still see `Invalid Sequence number provided` like in #1489 from prysm nodes if they have our metadata cached.	2020-08-17 02:13:28 +00:00
divma	113b40f321	Add multiaddr support in bootnodes (#1481 ) ## Issue Addressed #1384 Only catch, as currently implemented, when dialing the multiaddr nodes, there is no way to ask the peer manager if they are already connected or dialing	2020-08-17 02:13:26 +00:00
Age Manning	99acfb50f2	Update gossipsub duplicate cache (#1524 ) This potentially handles memory leak issues by preventing adding references to already seen gossipsub messages.	2020-08-17 01:27:33 +00:00
Age Manning	c75c06cf16	Update discv5 to alpha.9 (#1517 ) ## Discovery v5 update In this update we remove the openssl dependency in favour of rust-crypto. The update also removes a series of unnecessary async functions which may improve some of the issues we have been experiencing.	2020-08-15 04:02:14 +00:00
Paul Hauner	b063df5bf9	Cross-compile to vendored x86_84, aarch64 (Raspberry Pi 4) (#1497 ) ## Issue Addressed NA ## Proposed Changes Adds support for using the [`cross`](https://github.com/rust-embedded/cross) project to produce cross-compiled binaries using Docker images. Provides quite clean and simple cross-compiles cause all the complexity is hidden in Dockerfiles. It does require you to be in the `docker` group though. ## Details - Adds shortcut commands to `Makefile` - Ensures `reqwest` and `discv5` use vendored openssl libs (i.e., static not shared). - Switches to a [commit](`284f705964`) of blst that has a renamed C function to avoid a collision with openssl (upstream issue: https://github.com/supranational/blst/issues/21). - Updates `ring` to the latest satisfiable version, since an earlier version was causing issues with `cross`. - Off-topic, but adds extra message about Windows support as suggested by Discord user. ## Additional Info - ~~Blocked on #1495~~ - There are no tests in CI for this yet for a few reasons: - I'm hesitant to add more long-running tasks. - Short-term bitrot should be avoided since we'll use it each release. - In the long term I think it would be good to automate binary creation on a release. - I observed the binaries increase in size from 50mb to 52mb after these changes.	2020-08-11 05:16:30 +00:00
divma	1a67d15701	Mitigate too many outgoing connections (#1469 ) limit simultaneous outgoing connections attempts to a reasonable top as an extra layer of protection also shift the keep alive logic of the rpc handler to avoid needing to update it by hand. I think In rare cases this could make shutting down a connection a bit faster.	2020-08-11 02:16:31 +00:00
Age Manning	cbfae87aa6	Upgrade logs (#1495 ) ## Issue Addressed #1483 ## Proposed Changes Upgrades the log to a critical if a listener fails. We are able to listen on many interfaces so a single instance is not critical. We should however gracefully shutdown the client if we have no listeners, although the client can still function solely on outgoing connections. For now a critical is raised and I leave #1494 for more sophisticated handling of this. This also updates discv5 to handle errors of binding to a UDP socket such that lighthouse is now able to handle them.	2020-08-10 05:19:51 +00:00
Age Manning	04e4389efe	Patch gossipsub (#1490 ) ## Issue Addressed Some nodes not following head, high CPU usage and HTTP API delays ## Proposed Changes Patches gossipsub. Gossipsub was using an `lru_time_cache` to check for duplicates. This contained an `O(N)` lookup for every gossipsub message to update the time cache. This was causing high cpu usage and blocking network threads. This PR introduces a custom cache without `O(N)` inserts. This also adds built in safety mechanisms to prevent gossipsub from excessively retrying connections upon failure. A maximum limit is set after which we disconnect from the node from too many failed substream connections.	2020-08-08 08:09:04 +00:00
Age Manning	08a31c5a1a	Disconnect peers (#1484 ) ## Issue Addressed Peers that connected after the peer limit may remain connected in some circumstances. This ensures peers not in the peer manager's list get disconnected. Further logging is also added to track this behaviour.	2020-08-08 06:08:44 +00:00
Age Manning	a1f9769040	Libp2p update (#1482 ) Updates to latest libp2p master. This now has native noise support. This PR - Removes secio support - Prioritises mplex over yamux	2020-08-08 02:17:32 +00:00
divma	7d87e11e0f	Fix rpc coded response display (#1470 ) Prevent errors to be printed in debug mode	2020-08-06 04:29:23 +00:00
Pawan Dhananjay	983f768034	Remove ssz encoding support from rpc (#1457 ) ## Issue Addressed Partially resolves #1422 ## Proposed Changes Remove ssz encoding from req/resp in rpc.	2020-08-06 04:29:19 +00:00
Age Manning	09a615b2c0	Lighthouse crate v0.2.0 bump (#1450 ) ## Description This PR marks Lighthouse v0.2.0. This release marks the stable version of Lighthouse, ready for the approaching Medalla testnet.	2020-08-06 03:43:05 +00:00
divma	924ba66218	Update v0.12.2 gossip params (#1449 ) ## Issue Addressed #1422	2020-08-06 00:04:33 +00:00
Paul Hauner	f26adc0a36	Lighthouse v0.2.0 (Medalla) (#1452 ) ## Issue Addressed NA ## Proposed Changes - Moves the git-based versioning we were doing into the `lighthouse_version` crate in `common`. - Removes the `beacon_node/version` crate, replacing it with `lighthouse_version`. - Bumps the version to `v0.2.0`. ## Additional Info There are now two types of version string: 1. `const VERSION: &str = Lighthouse/v0.2.0-1419501f2+` 1. `version_with_platform() = Lighthouse/v0.2.0-1419501f2+/x86_64-linux` (1) is handy cause it's a `const` and shorter. (2) has platform info so it's more useful. Note that the plus-sign (`+`) indicates the the git commit is dirty (it used to be `(modified)` but I had to shorten it to fit into graffiti). These version strings are now included on: - `lighthouse --version` - `lcli --version` - `curl localhost:5052/node/version` - p2p messages when we communicate our version You can update the version by changing this constant (version is not related to a `Cargo.toml`): `b9ad7102d5/common/lighthouse_version/src/lib.rs (L4-L15)`	2020-08-04 07:44:53 +00:00
divma	1bbecbcf26	Track gossip subscriptions as a metric (#1445 ) ## Issue Addressed #1399 ## Proposed Changes Set an Int gauge per topic and inc/dec when peers subscribe/unsubscribe	2020-08-04 04:18:10 +00:00
Age Manning	31707ccf45	Shift author to sigma prime on some crates (#1440 ) Shifts the author to sigma prime on some crates	2020-08-04 02:31:41 +00:00
Age Manning	1419501f2e	Update peerdb constants (#1444 ) Increases the cache for disconnected and banned peers.	2020-08-03 12:48:22 +00:00
Age Manning	37679b8898	Update score decay behaviour (#1442 )	2020-08-03 20:46:08 +10:00
Age Manning	f634f073a8	Correct issue with network message passing (#1439 ) ## Issue Addressed Sync was breaking occasionally. The root cause appears to be identify crashing as events we being sent to the protocol after nodes were banned. Have not been able to reproduce sync issues since this update. ## Proposed Changes Only send messages to sub-behaviour protocols if the peer manager thinks the peer is connected. All other messages are dropped.	2020-08-03 09:35:53 +00:00
Age Manning	3b5da8f35f	Gossipsub update (#1432 ) ## Issue Addressed The most recent gossipsub update had an issue where some privacy settings lead to not sending a sequence number with the message. Although Lighthouse treats these as valid (based on current configuration) other clients may not. This corrects gossipsub to send sequence numbers where expected and based on the configuration settings.	2020-08-02 13:19:56 +00:00
divma	4d77784bb8	Rate limit RPC requests (#1402 ) ## Issue Addressed #1056 ## Proposed Changes - Add a rate limiter to the RPC behaviour. This also means the rate limiting occurs just before the door to the application level, so the number of connections a peer opens does not affect this (this would happen in the future if put on the handler) - The algorithm used is the leaky bucket as a meter / token bucket implemented the GCRA way - Each protocol has its own limit. Due to the way the algorithm works, the "small" protocols have a hard limit, while bbrange and bbroot allow [burstiness](https://www.wikiwand.com/en/Burstiness). This is so that a peer can't request hundreds of individual requests expecting only one block in a short period of time, it also allows a peer to send two half size requests instead of one with max if they want to without getting limited, and.. it also allows a peer to request a batch of the maximum size and then send _appropriately spaced_ requests of really small sizes. From what I've seen in sync this is plausible when reaching the target slot. ## Additional Info Needs to be heavily tested	2020-07-31 05:47:09 +00:00
Age Manning	a37e75f44b	Downgrade sync and rpc warn logs (#1417 ) * Downgrade sycn and rpc warn logs * Correct warning	2020-07-30 13:52:44 +10:00
Age Manning	febb300a2d	Limit incoming connection requests (#1413 ) ## Issue Addressed Limits the number of incoming connections and adjusts the buffer sizes in libp2p	2020-07-29 06:39:30 +00:00
Age Manning	395d99ce03	Sync update (#1412 ) ## Issue Addressed Recurring sync loop and invalid batch downloading ## Proposed Changes Shifts the batches to include the first slot of each epoch. This ensures the finalized is always downloaded once a chain has completed syncing. Also add in logic to prevent re-dialing disconnected peers. Non-performant peers get disconnected during sync, this prevents re-connection to these during sync. ## Additional Info N/A	2020-07-29 05:25:10 +00:00
Age Manning	ba0f3daf9d	Gossipsub update (#1400 ) ## Issue Addressed N/A ## Proposed Changes This provides a number of corrections and improvements to gossipsub. Specifically - Enables options for greater privacy around the message author - Provides greater flexibility on message validation - Prevents unvalidated messages from being gossiped - Shifts the duplicate cache to a time-based cache inside gossipsub - Updates the message-id to handle bytes - Bug fixes related to mesh maintenance and topic subscription. This should improve our attestation inclusion rate.	2020-07-29 03:40:22 +00:00
realbigsean	09b40b7a5e	Discover query grouping (#1364 ) ## Issue Addressed #1281 ## Proposed Changes Groups queries for specific subnets into groups of up to 3. ## Additional Info	2020-07-29 02:43:50 +00:00
divma	9ae9df806c	Fix clippy lints rpc (#1401 ) ## Issue Addressed #1388 partially (eth2_libp2p & network) ## Proposed Changes TLDR at the end - Complex types are 3 on the handlers/Behaviours but the types are `Poll<ComplexType>` where `ComplexType` comes from the traits of libp2p. Those, I don't thing are worth an alias. A couple more were from using tokio combinators and were removed writing things the async way and using [`BoxFuture`](https://docs.rs/futures/0.3.5/futures/future/type.BoxFuture.html) - The cognitive complexity.. I tried to address those before (they come from the poll functions too) and tbh they are cognitively simpler to understand the way they are now. Moving separate parts to functions doesn't add much since that code is not repeated and they all do early returns. If moved those returns would now need to be wrapped in an Option, probably, and checked to be returned again. I would leave them like that but that's just preference. - Too many arguments: They are not easily put together in a wrapping struct since the parameters don't relate semantically (Ex: fn new with a log, a reference to the chain, a peer, etc) but some may differ. - Needless returns were indeed needless ## Additional Info TLDR: removed needless return, used BoxFuture and async, left the rest untouched since those lgtm	2020-07-28 01:39:42 +00:00
Paul Hauner	b73c497be2	Support multiple BLS implementations (#1335 ) ## Issue Addressed NA ## Proposed Changes - Refactor the `bls` crate to support multiple BLS "backends" (e.g., milagro, blst, etc). - Removes some duplicate, unused code in `common/rest_types/src/validator.rs`. - Removes the old "upgrade legacy keypairs" functionality (these were unencrypted keys that haven't been supported for a few testnets, no one should be using them anymore). ## Additional Info Most of the files changed are just inconsequential changes to function names. ## TODO - [x] Optimization levels - [x] Infinity point: https://github.com/supranational/blst/issues/11 - [x] Ensure milagro and blst are tested via CI - [x] What to do with unsafe code? - [x] Test infinity point in signature sets	2020-07-25 02:03:18 +00:00
blacktemplar	23a8f31f83	Fix clippy warnings (#1385 ) ## Issue Addressed NA ## Proposed Changes Fixes most clippy warnings and ignores the rest of them, see issue #1388.	2020-07-23 14:18:00 +00:00
divma	ba10c80633	Refactor inbound substream logic with async (#1325 ) ## Issue Addressed #1112 The logic is slightly different but still valid wrt to error handling. - Inbound state is either Busy with a future that return the subtream (and info about the processing) - The state machine works as follows: - `Idle` with pending responses => `Busy` - `Busy` => finished ? if so and there are new pending responses then `Busy`, if not then `Idle` => not finished remains `Busy` - Add an `InboundInfo` for readability - Other stuff: - Close inbound substreams when all expected responses are sent - Remove the error variants from `RPCCodedResponse` and use the codes instead - Fix various spelling mistakes because I got sloppy last time Sorry for the delay Co-authored-by: Age Manning <Age@AgeManning.com>	2020-07-23 12:30:43 +00:00
blacktemplar	3c4daec9af	replace max_peers cli argument by target_peers and use excess peers above target_peers capped by a new constant PEER_EXCESS_FACTOR (relative to target_peers) (#1383 )	2020-07-23 13:55:36 +10:00
blacktemplar	f61a7113ac	Do not send regular status updates during syncing (#1375 )	2020-07-22 15:39:56 +10:00
Age Manning	0620f54f2f	Update discv5 and remove discovery retries (#1373 )	2020-07-21 14:19:55 +10:00
Pawan Dhananjay	1c90c816b7	Close outbound stream after sending (#1369 )	2020-07-21 10:57:44 +10:00
Age Manning	e940dcea47	Updates libp2p and noise hack (#1367 )	2020-07-21 10:54:47 +10:00
Age Manning	4a01f44206	Downgrade libp2p and gosispsub (#1358 ) Downgrades libp2p and the gossipsub updates. This looks to resolve the CPU usage issue we have been seeing. The root cause is likely inside the latest gossipsub updates, which will be addressed in a later PR	2020-07-15 05:04:09 +00:00
Age Manning	e6a8635b38	Discovery update (#1349 ) * Improve logging * Discovery update	2020-07-11 12:35:59 +10:00
Akihito Nakano	9ae218bfac	Remove outdated comments (#1336 )	2020-07-11 10:33:11 +10:00
Pawan Dhananjay	275148a152	Fix query expiry computation (#1346 )	2020-07-09 11:43:48 +10:00
Age Manning	025b262e01	Peer scoring updates (#1342 ) * Prevent banned peers from interacting with the behaviour * Update smallvec * Add logs * Correct log update	2020-07-07 20:08:05 +10:00
Age Manning	f500b24242	Update smallvec (#1339 )	2020-07-07 16:57:27 +10:00
Age Manning	5bc8fea2e0	Activate peer scoring (#1284 ) * Initial score structure * Peer manager update * Updates to dialing * Correct tests * Correct typos and remove unused function * Integrate scoring into the network crate * Clean warnings * Formatting * Shift core functionality into the behaviour * Temp commit * Shift disconnections into the behaviour * Temp commit * Update libp2p and gossipsub * Remove gossipsub lru cache * Correct merge conflicts * Modify handler and correct tests * Update enr network globals on socket update * Apply clippy lints * Add new prysm fingerprint * More clippy fixes	2020-07-07 10:13:16 +10:00
Age Manning	5977c00edb	Update libp2p and gossipsub (#1324 ) * Update libp2p and gossipsub * Remove gossipsub lru cache	2020-07-06 20:34:40 +10:00

1 2

64 Commits