lighthouse

Author	SHA1	Message	Date
Paul Hauner	7c0b2755c2	Don't requeue already-known RPC blocks (#4214 ) ## Issue Addressed NA ## Proposed Changes Adds an additional check to a feature introduced in #4179 to prevent us from re-queuing already-known blocks that could be rejected immediately. ## Additional Info Ideally this would have been included in v4.1.0, however we came across it too late to release it safely. We decided that the safest path forward is to release without this check and then patch it in the next version. The lack of this check should only result in a very minor performance impact (the impact is totally negligible in my assessment).	2023-05-15 07:22:04 +00:00
Paul Hauner	714ed53839	Add a flag for storing invalid blocks (#4194 ) ## Issue Addressed NA ## Proposed Changes Adds a flag to store invalid blocks on disk for teh debugz. Only some invalid blocks are stored, those which: - Were received via gossip (rather than RPC, for instance) - This keeps things simple to start with and should capture most blocks. - Passed gossip verification - This reduces the ability for random people to fill up our disk. A proposer signature is required to write something to disk. ## Additional Info It's possible that we'll store blocks that aren't necessarily invalid, but we had an internal error during verification. Those blocks seem like they might be useful sometimes.	2023-05-15 07:22:03 +00:00
ethDreamer	46db30416d	Implement Overflow LRU Cache for Pending Blobs (#4203 ) * All Necessary Objects Implement Encode/Decode * Major Components for LRUOverflowCache Implemented * Finish Database Code * Add Maintenance Methods * Added Maintenance Service * Persist Blobs on Shutdown / Reload on Startup * Address Clippy Complaints * Add (emum_behaviour = "tag") to ssz_derive * Convert Encode/Decode Implementations to "tag" * Started Adding Tests * Added a ton of tests * 1 character fix * Feature Guard Minimal Spec Tests * Update beacon_node/beacon_chain/src/data_availability_checker.rs Co-authored-by: realbigsean <seananderson33@GMAIL.com> * Address Sean's Comments * Add iter_raw_keys method * Remove TODOs --------- Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-05-12 10:08:24 -04:00
Jack McPherson	6235e452e1	Do not attempt to resubscribe to core topics (#4271 ) This commit adds a check to the networking service when handling core gossipsub topic subscription requests. If the BN is already subscribed to the core topics, we won't attempt to resubscribe. ## Issue Addressed #4258 ## Proposed Changes - In the networking service, check if we're already subscribed to all of the core gossipsub topics and, if so, do nothing ## Additional Info N/A	2023-05-08 07:15:26 +00:00
Age Manning	35ca086269	Backfill blocks only to the WSP by default (#4082 ) ## Limit Backfill Sync This PR transitions Lighthouse from syncing all the way back to genesis to only syncing back to the weak subjectivity point (~ 5 months) when syncing via a checkpoint sync. There are a number of important points to note with this PR: - Firstly and most importantly, this PR fundamentally shifts the default security guarantees of checkpoint syncing in Lighthouse. Prior to this PR, Lighthouse could verify the checkpoint of any given chain by ensuring the chain eventually terminates at the corresponding genesis. This guarantee can still be employed via the new CLI flag --genesis-backfill which will prompt lighthouse to the old behaviour of downloading all blocks back to genesis. The new behaviour only checks the proposer signatures for the last 5 months of blocks but cannot guarantee the chain matches the genesis chain. - I have not modified any of the peer scoring or RPC responses. Clients syncing from gensis, will downscore new Lighthouse peers that do not possess blocks prior to the WSP. This is by design, as Lighthouse nodes of this form, need a mechanism to sort through peers in order to find useful peers in order to complete their genesis sync. We therefore do not discriminate between empty/error responses for blocks prior or post the local WSP. If we request a block that a peer does not posses, then fundamentally that peer is less useful to us than other peers. - This will make a radical shift in that the majority of nodes will no longer store the full history of the chain. In the future we could add a pruning mechanism to remove old blocks from the db also. Co-authored-by: Paul Hauner <paul@paulhauner.com>	2023-05-05 03:49:23 +00:00
Akihito Nakano	edbb47dd03	Update igd to v0.12.1 (#4257 ) ## Issue Addressed https://github.com/sigp/lighthouse/issues/4171 ## Proposed Changes Through [this PR](https://github.com/sbstp/rust-igd/pull/56) in rust-igd, `igd` v0.12.1 no longer panics if there is an issue while searching for a gateway. So updating igd makes lighthouse emit a helpful log instead of panicking. ## Additional Info No CHANGELOG exists in rust-igd. 👀 Here is the commit history between v0.11.1 and v0.12.1. No breaking changes. https://github.com/sbstp/rust-igd/compare/v0.11.1...v0.12.1	2023-05-03 04:12:14 +00:00
Age Manning	616bee6757	Maintain trusted peers (#4159 ) ## Issue Addressed #4150 ## Proposed Changes Maintain trusted peers in the pruning logic. ~~In principle the changes here are not necessary as a trusted peer has a max score (100) and all other peers can have at most 0 (because we don't implement positive scores). This means that we should never prune trusted peers unless we have more trusted peers than the target peer count.~~ This change shifts this logic to explicitly never prune trusted peers which I expect is the intuitive behaviour. ~~I suspect the issue in #4150 arises when a trusted peer disconnects from us for one reason or another and then we remove that peer from our peerdb as it becomes stale. When it re-connects at some large time later, it is no longer a trusted peer.~~ Currently we do disconnect trusted peers, and this PR corrects this to maintain trusted peers in the pruning logic. As suggested in #4150 we maintain trusted peers in the db and thus we remember them even if they disconnect from us.	2023-05-03 04:12:10 +00:00
realbigsean	9db6b39dc3	fix check on max request size (#4250 )	2023-05-02 19:14:02 -04:00
Michael Sproul	c11638c36c	Split common crates out into their own repos (#3890 ) ## Proposed Changes Split out several crates which now exist in separate repos under `sigp`. - [`ssz` and `ssz_derive`](https://github.com/sigp/ethereum_ssz) - [`tree_hash` and `tree_hash_derive`](https://github.com/sigp/tree_hash) - [`ethereum_hashing`](https://github.com/sigp/ethereum_hashing) - [`ethereum_serde_utils`](https://github.com/sigp/ethereum_serde_utils) - [`ssz_types`](https://github.com/sigp/ssz_types) For the published crates see: https://crates.io/teams/github:sigp:crates-io?sort=recent-updates. ## Additional Info - [x] Need to work out how to handle versioning. I was hoping to do 1.0 versions of several crates, but if they depend on `ethereum-types 0.x` that is not going to work. EDIT: decided to go with 0.5.x versions. - [x] Need to port several changes from `tree-states`, `capella`, `eip4844` branches to the external repos.	2023-04-28 01:15:40 +00:00
ethDreamer	c1d47da02d	Update `engine_api` to latest version (#4223 ) * Update Engine API to Latest * Get Mock EE Working * Fix Mock EE * Update Engine API Again * Rip out get_blobs_bundle Stuff * Fix Test Harness * Fix Clippy Complaints * Fix Beacon Chain Tests	2023-04-27 14:18:21 -04:00
Age Manning	7456e1e8fa	Separate BN for block proposals (#4182 ) It is a well-known fact that IP addresses for beacon nodes used by specific validators can be de-anonymized. There is an assumed risk that a malicious user may attempt to DOS validators when producing blocks to prevent chain growth/liveness. Although there are a number of ideas put forward to address this, there a few simple approaches we can take to mitigate this risk. Currently, a Lighthouse user is able to set a number of beacon-nodes that their validator client can connect to. If one beacon node is taken offline, it can fallback to another. Different beacon nodes can use VPNs or rotate IPs in order to mask their IPs. This PR provides an additional setup option which further mitigates attacks of this kind. This PR introduces a CLI flag --proposer-only to the beacon node. Setting this flag will configure the beacon node to run with minimal peers and crucially will not subscribe to subnets or sync committees. Therefore nodes of this kind should not be identified as nodes connected to validators of any kind. It also introduces a CLI flag --proposer-nodes to the validator client. Users can then provide a number of beacon nodes (which may or may not run the --proposer-only flag) that the Validator client will use for block production and propagation only. If these nodes fail, the validator client will fallback to the default list of beacon nodes. Users are then able to set up a number of beacon nodes dedicated to block proposals (which are unlikely to be identified as validator nodes) and point their validator clients to produce blocks on these nodes and attest on other beacon nodes. An attack attempting to prevent liveness on the eth2 network would then need to preemptively find and attack the proposer nodes which is significantly more difficult than the default setup. This is a follow on from: #3328 Co-authored-by: Michael Sproul <michael@sigmaprime.io> Co-authored-by: Paul Hauner <paul@paulhauner.com>	2023-04-26 01:12:36 +00:00
Pawan Dhananjay	7a36d004e4	Subscribe blob topics (#4224 )	2023-04-22 09:21:09 -04:00
Pawan Dhananjay	b6c0e91c05	Merge branch 'eip4844' into deneb-free-blobs	2023-04-21 14:34:50 -07:00
Pawan Dhananjay	689c0f76d3	Merge branch 'unstable' into eip4844	2023-04-21 14:13:25 -07:00
Pawan Dhananjay	895bbd6c03	Gossip conditions deneb (#4164 ) * Add all gossip conditions * Handle some gossip errors * Update beacon_node/beacon_chain/src/blob_verification.rs Co-authored-by: Divma <26765164+divagant-martian@users.noreply.github.com> * Add an ObservedBlobSidecars cache --------- Co-authored-by: Divma <26765164+divagant-martian@users.noreply.github.com>	2023-04-20 18:26:20 -04:00
Paul Hauner	48843ba198	Check lateness of block before requeuing it (#4208 ) ## Issue Addressed NA ## Proposed Changes Avoids reprocessing loops introduced in #4179. (Also somewhat related to #4192). Breaks the re-queue loop by only re-queuing when an RPC block is received before the attestation creation deadline. I've put `proposal_is_known` behind a closure to avoid interacting with the `observed_proposers` lock unnecessarily. ## Additional Info NA	2023-04-19 04:23:20 +00:00
Paul Hauner	dd124b2d68	Address observed proposers behaviour (#4192 ) ## Issue Addressed NA ## Proposed Changes Apply two changes to code introduced in #4179: 1. Remove the `ERRO` log for when we error on `proposer_has_been_observed()`. We were seeing a lot of this in our logs for finalized blocks and it's a bit noisy. 1. Use `false` rather than `true` for `proposal_already_known` when there is an error. If a block raises an error in `proposer_has_been_observed()` then the block must be invalid, so we should process (and reject) it now rather than queuing it. For reference, here is one of the offending `ERRO` logs: ``` ERRO Failed to check observed proposers block_root: 0x5845…878e, source: rpc, error: FinalizedBlock { slot: Slot(5410983), finalized_slot: Slot(5411232) } ``` ## Additional Info NA	2023-04-14 06:37:16 +00:00
Michael Sproul	a3669abac5	Avoid processing redundant RPC blocks (#4179 ) ## Proposed Changes We already make some attempts to avoid processing RPC blocks when a block from the same proposer is already being processed through gossip. This PR strengthens that guarantee by using the existing cache for `observed_block_producers` to inform whether an RPC block's processing should be delayed.	2023-04-13 07:05:02 +00:00
Pawan Dhananjay	3b117f4bf6	Add a flag to disable peer scoring (#4135 ) ## Issue Addressed N/A ## Proposed Changes Adds a flag for disabling peer scoring. This is useful for local testing and testing small networks for new features.	2023-04-12 01:48:19 +00:00
Diva M	911a63559b	Merge branch 'eip4844' into deneb-free-blobs	2023-04-05 13:33:33 -05:00
Pawan Dhananjay	1b8225c76d	Revert upgrade to tokio utils to reprocessing queue (#4167 )	2023-04-05 11:43:39 -05:00
Diva M	32f9ba04d7	fix merge conflict	2023-04-04 12:10:51 -05:00
Diva M	3c1a22ceaf	Merge commit '1e029ce5384e911390a513e2d1885532f34a8b2b' into eip4844	2023-04-04 11:56:54 -05:00
Jimmy Chen	2de3451011	Rate limiting backfill sync (#3936 ) ## Issue Addressed #3212 ## Proposed Changes - Introduce a new `rate_limiting_backfill_queue` - any new inbound backfill work events gets immediately sent to this FIFO queue without any processing - Spawn a `backfill_scheduler` routine that pops a backfill event from the FIFO queue at specified intervals (currently halfway through a slot, or at 6s after slot start for 12s slots) and sends the event to `BeaconProcessor` via a `scheduled_backfill_work_tx` channel - This channel gets polled last in the `InboundEvents`, and work event received is wrapped in a `InboundEvent::ScheduledBackfillWork` enum variant, which gets processed immediately or queued by the `BeaconProcessor` (existing logic applies from here) Diagram comparing backfill processing with / without rate-limiting: https://github.com/sigp/lighthouse/issues/3212#issuecomment-1386249922 See this comment for @paulhauner's explanation and solution: https://github.com/sigp/lighthouse/issues/3212#issuecomment-1384674956 ## Additional Info I've compared this branch (with backfill processing rate limited to to 1 and 3 batches per slot) against the latest stable version. The CPU usage during backfill sync is reduced by ~5% - 20%, more details on this page: https://hackmd.io/@jimmygchen/SJuVpJL3j The above testing is done on Goerli (as I don't currently have hardware for Mainnet), I'm guessing the differences are likely to be bigger on mainnet due to block size. ### TODOs - [x] Experiment with processing multiple batches per slot. (need to think about how to do this for different slot durations) - [x] Add option to disable rate-limiting, enabed by default. - [x] (No longer required now we're reusing the reprocessing queue) Complete the `backfill_scheduler` task when backfill sync is completed or not required	2023-04-03 03:02:55 +00:00
realbigsean	deec9c51ba	clean up blob by root response (#4136 )	2023-03-28 12:49:32 -04:00
realbigsean	d24e5cc22a	clean up blobs by range response (#4137 )	2023-03-28 12:49:19 -04:00
realbigsean	af974dc0b8	use block wrapper in sync pairing (#4131 )	2023-03-26 19:18:54 -04:00
realbigsean	a5addf661c	Rename eip4844 to deneb (#4129 ) * rename 4844 to deneb * rename 4844 to deneb * move excess data gas field * get EF tests working * fix ef tests lint * fix the blob identifier ef test * fix accessed files ef test script * get beacon chain tests passing	2023-03-26 11:49:16 -04:00
Pawan Dhananjay	b276af98b7	Rework block processing (#4092 ) * introduce availability pending block * add intoavailableblock trait * small fixes * add 'gossip blob cache' and start to clean up processing and transition types * shard memory blob cache * Initial commit * Fix after rebase * Add gossip verification conditions * cache cleanup * general chaos * extended chaos * cargo fmt * more progress * more progress * tons of changes, just tryna compile * everything, everywhere, all at once * Reprocess an ExecutedBlock on unavailable blobs * Add sus gossip verification for blobs * Merge stuff * Remove reprocessing cache stuff * lint * Add a wrapper to allow construction of only valid `AvailableBlock`s * rename blob arc list to blob list * merge cleanuo * Revert "merge cleanuo" This reverts commit 5e98326878c77528d0c4668c5a4db4a4b0fbaeaa. * Revert "Revert "merge cleanuo"" This reverts commit 3a4009443a5812b3028abe855079307436dc5419. * fix rpc methods * move beacon block and blob to eth2/types * rename gossip blob cache to data availability checker * lots of changes * fix some compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * cargo fmt * use a common data structure for block import types * fix availability check on proposal import * refactor the blob cache and split the block wrapper into two types * add type conversion for signed block and block wrapper * fix beacon chain tests and do some renaming, add some comments * Partial processing (#4) * move beacon block and blob to eth2/types * rename gossip blob cache to data availability checker * lots of changes * fix some compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * fix compilation issues * cargo fmt * use a common data structure for block import types * fix availability check on proposal import * refactor the blob cache and split the block wrapper into two types * add type conversion for signed block and block wrapper * fix beacon chain tests and do some renaming, add some comments * cargo update (#6) --------- Co-authored-by: realbigsean <sean@sigmaprime.io> Co-authored-by: realbigsean <seananderson33@gmail.com>	2023-03-24 17:30:41 -04:00
Diva M	25a2d8f078	Merge branch 'eip4844' into deneb-free-blobs	2023-03-24 14:38:29 -05:00
Diva M	1b9cfcc11b	Merge branch 'unstable' into eip4844	2023-03-24 13:32:50 -05:00
Diva M	7fad926b65	Merge commit '65a5eb829264cb279ed66814c961991ae3a0a04b' into eip4844	2023-03-24 13:24:21 -05:00
ethDreamer	d1e653cfdb	Update Blob Storage Structure (#4104 ) * Initial Changes to Blob Storage * Add Arc to SignedBlobSidecar Definition	2023-03-21 15:33:06 -04:00
Paul Hauner	59e45fe349	Reduce verbosity of reprocess queue logs (#4101 ) ## Issue Addressed NA ## Proposed Changes Replaces #4058 to attempt to reduce `ERRO Failed to send scheduled attestation` spam and provide more information for diagnosis. With this PR we achieve: - When dequeuing attestations after a block is received, send only one log which reports `n` failures (rather than `n` logs reporting `n` failures). - Make a distinction in logs between two separate attestation dequeuing events. - Add more information to both log events to help assist with troubleshooting. ## Additional Info NA	2023-03-21 05:15:00 +00:00
ethDreamer	65a5eb8292	Reconstruct Payloads using Payload Bodies Methods (#4028 ) ## Issue Addressed * #3895 Co-authored-by: ethDreamer <37123614+ethDreamer@users.noreply.github.com> Co-authored-by: Michael Sproul <michael@sigmaprime.io>	2023-03-19 23:15:59 +00:00
Diva M	78414333a2	Merge branch 'eip4844' into deneb-free-blobs	2023-03-17 16:39:17 -05:00
Diva M	607242c127	Merge branch 'unstable' into eip4844	2023-03-17 16:26:51 -05:00
Michael Sproul	4c2d4af6cd	Make more noise when the EL is broken (#3986 ) ## Issue Addressed Closes #3814, replaces #3818. ## Proposed Changes * Add a WARN log for the case where we are attempting to sync chain segments but can't process them because they're building on an invalid parent. The most common case where we see this is when the execution node database is corrupt, causing sync to stall mysteriously (because we're currently logging the failure only at debug level). * Additionally I've bumped up the logging for invalid execution payloads to `WARN`. This may result in some duplicate logs as we log errors from the `beacon_chain` and then again from the beacon processor. Invalid payloads and corrupt DBs _should_ be rare enough that this doesn't produce overwhelming log volume.	2023-03-17 00:44:02 +00:00
Divma	3c18e1a3a4	thread blocks and blobs to sync (#4100 ) * thread blocks and blobs to sync * satisfy dead code analysis	2023-03-16 19:20:39 -05:00
Age Manning	3d99ce25f8	Correct a race condition when dialing peers (#4056 ) There is a race condition which occurs when multiple discovery queries return at almost the exact same time and they independently contain a useful peer we would like to connect to. The condition can occur that we can add the same peer to the dial queue, before we get a chance to process the queue. This ends up displaying an error to the user: ``` ERRO Dialing an already dialing peer ``` Although this error is harmless it's not ideal. There are two solutions to resolving this: 1. As we decide to dial the peer, we change the state in the peer-db to dialing (before we add it to the queue) which would prevent other requests from adding to the queue. 2. We prevent duplicates in the dial queue This PR has opted for 2. because 1. will complicate the code in that we are changing states in non-intuitive places. Although this technically adds a very slight performance cost, its probably a cleaner solution as we can keep the state-changing logic in one place.	2023-03-16 05:44:54 +00:00
realbigsean	b303d2fb7e	lints	2023-03-15 15:32:22 -04:00
Diva M	4a39e43f96	Merge branch 'eip4844' into deneb-free-blobs	2023-03-15 12:26:30 -05:00
Divma	2c9477de43	Fix block and blob coupling in the network context (#4086 ) * update docs * introduce a temp enum to model an adjusted `BlockWrapper` and fix blob coupling * fix compilation issue * fix blob coupling in the network context * review comments	2023-03-15 11:04:45 -05:00
Jimmy Chen	2ef3ebbef3	Update SignedBlobSidecar container (#4078 )	2023-03-15 11:03:56 -05:00
Daniel Ramirez Chiquillo	1ec3041673	Remove Router/Processor Code (#4002 ) ## Issue Addressed #3938 ## Proposed Changes - `network::Processor` is deleted and all it's logic is moved to `network::Router`. - The `network::Router` module is moved to a single file. - The following functions are deleted: `on_disconnect` `send_status` `on_status_response` `on_blocks_by_root_request` `on_lightclient_bootstrap` `on_blocks_by_range_request` `on_block_gossip` `on_unaggregated_attestation_gossip` `on_aggregated_attestation_gossip` `on_voluntary_exit_gossip` `on_proposer_slashing_gossip` `on_attester_slashing_gossip` `on_sync_committee_signature_gossip` `on_sync_committee_contribution_gossip` `on_light_client_finality_update_gossip` `on_light_client_optimistic_update_gossip`. This deletions are possible because the updated `Router` allows the underlying methods to be called directly.	2023-03-15 01:27:47 +00:00
Diva M	7f2e9b80bb	Merge branch 'unstable' into eip4844	2023-03-14 12:00:32 -05:00
Divma	e190ebb8a0	Support for Ipv6 (#4046 ) ## Issue Addressed Add support for ipv6 and dual stack in lighthouse. ## Proposed Changes From an user perspective, now setting an ipv6 address, optionally configuring the ports should feel exactly the same as using an ipv4 address. If listening over both ipv4 and ipv6 then the user needs to: - use the `--listen-address` two times (ipv4 and ipv6 addresses) - `--port6` becomes then required - `--discovery-port6` can now be used to additionally configure the ipv6 udp port ### Rough list of code changes - Discovery: - Table filter and ip mode set to match the listening config. - Ipv6 address, tcp port and udp port set in the ENR builder - Reported addresses now check which tcp port to give to libp2p - LH Network Service: - Can listen over Ipv6, Ipv4, or both. This uses two sockets. Using mapped addresses is disabled from libp2p and it's the most compatible option. - NetworkGlobals: - No longer stores udp port since was not used at all. Instead, stores the Ipv4 and Ipv6 TCP ports. - NetworkConfig: - Update names to make it clear that previous udp and tcp ports in ENR were Ipv4 - Add fields to configure Ipv6 udp and tcp ports in the ENR - Include advertised enr Ipv6 address. - Add type to model Listening address that's either Ipv4, Ipv6 or both. A listening address includes the ip, udp port and tcp port. - UPnP: - Kept only for ipv4 - Cli flags: - `--listen-addresses` now can take up to two values - `--port` will apply to ipv4 or ipv6 if only one listening address is given. If two listening addresses are given it will apply only to Ipv4. - `--port6` New flag required when listening over ipv4 and ipv6 that applies exclusively to Ipv6. - `--discovery-port` will now apply to ipv4 and ipv6 if only one listening address is given. - `--discovery-port6` New flag to configure the individual udp port of ipv6 if listening over both ipv4 and ipv6. - `--enr-udp-port` Updated docs to specify that it only applies to ipv4. This is an old behaviour. - `--enr-udp6-port` Added to configure the enr udp6 field. - `--enr-tcp-port` Updated docs to specify that it only applies to ipv4. This is an old behaviour. - `--enr-tcp6-port` Added to configure the enr tcp6 field. - `--enr-addresses` now can take two values. - `--enr-match` updated behaviour. - Common: - rename `unused_port` functions to specify that they are over ipv4. - add functions to get unused ports over ipv6. - Testing binaries - Updated code to reflect network config changes and unused_port changes. ## Additional Info TODOs: - use two sockets in discovery. I'll get back to this and it's on https://github.com/sigp/discv5/pull/160 - lcli allow listening over two sockets in generate_bootnodes_enr - add at least one smoke flag for ipv6 (I have tested this and works for me) - update the book	2023-03-14 01:13:34 +00:00
Diva M	ae3e5f73d6	fmt	2023-03-10 11:24:22 -05:00
Divma	140bdd370d	update code paths in the `network` crate (#4065 ) * wip * fix router * arc the byroot responses we send * add placeholder for blob verification * respond to blobs by range and blobs by root request in the most horrible and gross way ever * everything in sync is now unimplemented * fix compiation issues * http_pi change is very small, just add it * remove ctrl-c ctrl-v's docs	2023-03-10 16:52:31 +05:30
Divma	545532a883	fix rpc types to free the blobs (#4059 ) * rename to follow name in spec * use roots and indexes * wip * fix req/resp types * move blob identifier to consensus types	2023-03-07 16:28:45 -05:00
Diva M	bf40acd9df	adjust constant to spec values and names	2023-03-06 17:32:40 -05:00
Diva M	f16e82ab2c	Merge branch 'unstable' into eip4844	2023-03-03 14:14:18 -05:00
Diva M	d93753cc88	Merge branch 'unstable' into off-4844	2023-03-02 15:38:00 -05:00
Pawan Dhananjay	5b18fd92cb	Cleaner logic for gossip subscriptions for new forks (#4030 ) ## Issue Addressed Cleaner resolution for #4006 ## Proposed Changes We are currently subscribing to core topics of new forks way before the actual fork since we had just a single `CORE_TOPICS` array. This PR separates the core topics for every fork and subscribes to only required topics based on the current fork. Also adds logic for subscribing to the core topics of a new fork only 2 slots before the fork happens. 2 slots is to give enough time for the gossip meshes to form. Currently doesn't add logic to remove topics from older forks in new forks. For e.g. in the coupled 4844 world, we had to remove the `BeaconBlock` topic in favour of `BeaconBlocksAndBlobsSidecar` at the 4844 fork. It should be easy enough to add though. Not adding it because I'm assuming that #4019 will get merged before this PR and we won't require any deletion logic. Happy to add it regardless though.	2023-03-01 09:22:48 +00:00
Divma	047c7544e3	Clean capella (#4019 ) ## Issue Addressed Cleans up all the remnants of 4844 in capella. This makes sure when 4844 is reviewed there is nothing we are missing because it got included here ## Proposed Changes drop a bomb on every 4844 thing ## Additional Info Merge process I did (locally) is as follows: - squash merge to produce one commit - in new branch off unstable with the squashed commit create a `git revert HEAD` commit - merge that new branch onto 4844 with `--strategy ours` - compare local 4844 to remote 4844 and make sure the diff is empty - enjoy Co-authored-by: Paul Hauner <paul@paulhauner.com>	2023-03-01 03:19:02 +00:00
Paul Hauner	9c81be8ac4	Fix metric (#4020 )	2023-02-22 09:46:45 +11:00
Michael Sproul	066c27750a	Merge remote-tracking branch 'origin/staging' into capella-update	2023-02-17 12:05:36 +11:00
Divma	ffeb8b6e05	blacklist tests in windows (#3961 ) ## Issue Addressed Windows tests for subscription and unsubscriptions fail in CI sporadically. We usually ignore this failures, so this PR aims to help reduce the failure noise. Associated issue is https://github.com/sigp/lighthouse/issues/3960	2023-02-16 23:34:30 +00:00
realbigsean	b805fa6279	merge with upstream	2023-02-15 14:20:12 -05:00
Emilia Hane	2672cf40bb	Better fix for debug tests	2023-02-15 11:47:56 +01:00
Emilia Hane	13efd47238	fixup! Disable use of system time in tests	2023-02-15 09:20:30 +01:00
Emilia Hane	9e4abc79fb	Comment out tests that use system time	2023-02-14 14:12:50 +01:00
Emilia Hane	73c7ad73b8	Disable use of system time in tests	2023-02-14 13:33:38 +01:00
Michael Sproul	18c8cab4da	Merge remote-tracking branch 'origin/unstable' into capella-merge	2023-02-14 12:07:27 +11:00
realbigsean	d2ecbd942e	fix a couple new lints	2023-02-13 17:13:47 -05:00
realbigsean	cd8757de1c	Revert "make batch size check compile time panic" This reverts commit `68f2484efc`.	2023-02-13 16:51:55 -05:00
realbigsean	68f2484efc	make batch size check compile time panic	2023-02-13 16:51:46 -05:00
realbigsean	4c3561dcaf	make batch size check compile time panic	2023-02-13 16:50:33 -05:00
realbigsean	fc2d07b4e3	allow unused	2023-02-13 16:36:38 -05:00
realbigsean	28702c9d5d	merge upstream, add back `get_blobs` logic	2023-02-13 16:29:21 -05:00
Paul Hauner	84843d67d7	Reduce some EE and builder related ERRO logs to WARN (#3966 ) ## Issue Addressed NA ## Proposed Changes Our `ERRO` stream has been rather noisy since the merge due to some unexpected behaviours of builders and EEs. Now that we've been running post-merge for a while, I think we can drop some of these `ERRO` to `WARN` so we're not "crying wolf". The modified logs are: #### `ERRO Execution engine call failed` I'm seeing this quite frequently on Geth nodes. They seem to timeout when they're busy and it rarely indicates a serious issue. We also have logging across block import, fork choice updating and payload production that raise `ERRO` or `CRIT` when the EE times out, so I think we're not at risk of silencing actual issues. #### `ERRO "Builder failed to reveal payload"` In #3775 we reduced this log from `CRIT` to `ERRO` since it's common for builders to fail to reveal the block to the producer directly whilst still broadcasting it to the networ. I think it's worth dropping this to `WARN` since it's rarely interesting. I elected to stay with `WARN` since I really do wish builders would fulfill their API promises by returning the block to us. Perhaps I'm just being pedantic here, I could be convinced otherwise. #### `ERRO "Relay error when registering validator(s)"` It seems like builders and/or mev-boost struggle to handle heavy loads of validator registrations. I haven't observed issues with validators not actually being registered, but I see timeouts on these endpoints many times a day. It doesn't seem like this `ERRO` is worth it. #### `ERRO Error fetching block for peer ExecutionLayerErrorPayloadReconstruction` This means we failed to respond to a peer on the P2P network with a block they requested because of an error in the `execution_layer`. It's very common to see timeouts or incomplete responses on this endpoint whilst the EE is busy and I don't think it's important enough for an `ERRO`. As long as the peer count stays high, I don't think the user needs to be actively concerned about how we're responding to peers. ## Additional Info NA	2023-02-12 23:14:08 +00:00
Emilia Hane	4d3ff347a3	Fixes after rebasing eip4844	2023-02-10 15:34:58 +01:00
Emilia Hane	5437dcae9c	Fix conflicts rebasing eip4844	2023-02-10 15:34:58 +01:00
Emilia Hane	7545ae9e9b	fixup! Fix block lookup debug tests	2023-02-10 15:34:46 +01:00
Emilia Hane	6beca6defc	Fix range sync tests	2023-02-10 09:41:24 +01:00
Emilia Hane	e9e198a2b6	Fix conflicts rebasing eip4844	2023-02-10 09:41:23 +01:00
Emilia Hane	d292a3a6a8	Fix conflicts rebasing eip4844	2023-02-10 09:41:23 +01:00
Emilia Hane	09370e70d9	Fix rebase conflicts	2023-02-10 09:41:19 +01:00
Emilia Hane	8365d76277	fixup! Debug tests	2023-02-10 09:39:22 +01:00
Emilia Hane	16cb9cfca2	fixup! Debug tests	2023-02-10 09:39:22 +01:00
Emilia Hane	7220f35ff6	Debug tests	2023-02-10 09:39:21 +01:00
Emilia Hane	995b2715f2	Fix network block_lookups test	2023-02-10 09:39:21 +01:00
Emilia Hane	3676ce78b5	Fix rebase conflicts	2023-02-10 09:39:21 +01:00
Emilia Hane	56c84178f2	Fix conflicts rebasing eip4844	2023-02-08 11:44:44 +01:00
realbigsean	a42d07592c	fix compilation issues after merge	2023-02-07 12:33:29 -05:00
realbigsean	26a296246d	Merge branch 'capella' of https://github.com/sigp/lighthouse into eip4844 # Conflicts: # beacon_node/beacon_chain/src/beacon_chain.rs # beacon_node/beacon_chain/src/block_verification.rs # beacon_node/beacon_chain/src/test_utils.rs # beacon_node/execution_layer/src/engine_api.rs # beacon_node/execution_layer/src/engine_api/http.rs # beacon_node/execution_layer/src/lib.rs # beacon_node/execution_layer/src/test_utils/handle_rpc.rs # beacon_node/http_api/src/lib.rs # beacon_node/http_api/tests/fork_tests.rs # beacon_node/network/src/beacon_processor/mod.rs # beacon_node/network/src/beacon_processor/work_reprocessing_queue.rs # beacon_node/network/src/beacon_processor/worker/sync_methods.rs # beacon_node/operation_pool/src/bls_to_execution_changes.rs # beacon_node/operation_pool/src/lib.rs # beacon_node/operation_pool/src/persistence.rs # consensus/serde_utils/src/u256_hex_be_opt.rs # testing/antithesis/Dockerfile.libvoidstar	2023-02-07 12:12:56 -05:00
Paul Hauner	e062a7cf76	Broadcast address changes at Capella (#3919 ) * Add first efforts at broadcast * Tidy * Move broadcast code to client * Progress with broadcast impl * Rename to address change * Fix compile errors * Use `while` loop * Tidy * Flip broadcast condition * Switch to forgetting individual indices * Always broadcast when the node starts * Refactor into two functions * Add testing * Add another test * Tidy, add more testing * Tidy * Add test, rename enum * Rename enum again * Tidy * Break loop early * Add V15 schema migration * Bump schema version * Progress with migration * Update beacon_node/client/src/address_change_broadcast.rs Co-authored-by: Michael Sproul <micsproul@gmail.com> * Fix typo in function name --------- Co-authored-by: Michael Sproul <micsproul@gmail.com>	2023-02-07 17:13:49 +11:00
realbigsean	37e7c1d5c7	keep verification of payloads pre 4844	2023-01-27 17:59:40 +01:00
realbigsean	7c8d97c06e	remove unused import	2023-01-25 14:26:01 +01:00
GeemoCandama	f857811e5f	light client optimistic update reprocessing (#3799 ) Currently there is a race between receiving blocks and receiving light client optimistic updates (in unstable), which results in processing errors. This is a continuation of PR #3693 and seeks to progress on issue #3651 Add the parent_root to ReprocessQueueMessage::BlockImported so we can remove blocks from queue when a block arrives that has the same parent root. We use the parent root as opposed to the block_root because the LightClientOptimisticUpdate does not contain the block_root. If light_client_optimistic_update.attested_header.canonical_root() != head_block.message().parent_root() then we queue the update. Otherwise we process immediately. michaelsproul came up with this idea. The code was heavily based off of the attestation reprocessing. I have not properly tested this to see if it works as intended.	2023-01-25 14:23:33 +01:00
Michael Sproul	a4cfe50ade	Import BLS to execution changes before Capella (#3892 ) * Import BLS to execution changes before Capella * Test for BLS to execution change HTTP API * Pack BLS to execution changes in LIFO order * Remove unused var * Clippy	2023-01-25 14:21:54 +01:00
Age Manning	528f7181bc	Improve block delay metrics (#3894 ) We recently ran a large-block experiment on the testnet and plan to do a further experiment on mainnet. Although the metrics recovered from lighthouse nodes were quite useful, I think we could do with greater resolution in the block delay metrics and get some specific values for each block (currently these can be lost to large exponential histogram buckets). This PR increases the resolution of the block delay histogram buckets, but also introduces a new metric which records the last block delay. Depending on the polling resolution of the metric server, we can lose some block delay information, however it will always give us a specific value and we will not lose exact data based on poor resolution histogram buckets.	2023-01-25 14:21:53 +01:00
realbigsean	5e8d79891b	merge conflict resolution	2023-01-25 11:10:44 +01:00
Michael Sproul	c76a1971cc	Merge remote-tracking branch 'origin/unstable' into capella	2023-01-25 14:20:16 +11:00
GeemoCandama	a7351c00c0	light client optimistic update reprocessing (#3799 ) ## Issue Addressed Currently there is a race between receiving blocks and receiving light client optimistic updates (in unstable), which results in processing errors. This is a continuation of PR #3693 and seeks to progress on issue #3651 ## Proposed Changes Add the parent_root to ReprocessQueueMessage::BlockImported so we can remove blocks from queue when a block arrives that has the same parent root. We use the parent root as opposed to the block_root because the LightClientOptimisticUpdate does not contain the block_root. If light_client_optimistic_update.attested_header.canonical_root() != head_block.message().parent_root() then we queue the update. Otherwise we process immediately. ## Additional Info michaelsproul came up with this idea. The code was heavily based off of the attestation reprocessing. I have not properly tested this to see if it works as intended.	2023-01-24 22:17:50 +00:00
realbigsean	d3240c1ffb	fix common issue across blocks by range and blobs by range	2023-01-24 15:42:28 +01:00
realbigsean	18d4faf611	review updates	2023-01-24 15:30:29 +01:00
realbigsean	2225e6ac89	pass in data availability boundary to the get_blobs method	2023-01-24 14:35:07 +01:00
realbigsean	b658cc7aaf	simplify checking attester cache for block and blobs. use ResourceUnavailable according to the spec	2023-01-24 10:50:47 +01:00
Emilia Hane	e14550425d	Fix mismatched response bug	2023-01-23 13:23:04 +01:00
Emilia Hane	81a754577d	fixup! Improve error handling	2023-01-21 15:47:33 +01:00
Emilia Hane	f32f08eec0	Fix typo	2023-01-21 14:47:14 +01:00
Emilia Hane	5fc648217d	fixup! Improve error handling	2023-01-21 14:46:24 +01:00
realbigsean	cbd09dc281	finish refactor	2023-01-21 04:48:25 -05:00
Michael Sproul	d8abf2fc41	Import BLS to execution changes before Capella (#3892 ) * Import BLS to execution changes before Capella * Test for BLS to execution change HTTP API * Pack BLS to execution changes in LIFO order * Remove unused var * Clippy	2023-01-21 10:39:59 +11:00
Michael Sproul	bb0e99c097	Merge remote-tracking branch 'origin/unstable' into capella	2023-01-21 10:37:26 +11:00
Emilia Hane	f7eb89ddd9	Improve error handling	2023-01-20 21:16:47 +01:00
realbigsean	c6479444c2	don't send errors when we correctly don't have blobs	2023-01-20 21:16:47 +01:00
realbigsean	e1ce4e5b78	make explicity BlobsUnavailable error and handle it directly	2023-01-20 21:16:47 +01:00
realbigsean	f7f64eb007	fix/consolidate some error handling	2023-01-20 21:16:47 +01:00
Emilia Hane	89cb58d17b	Fix typo Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:47 +01:00
Emilia Hane	9cc25162e2	Send error message if eip4844 fork disabled Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:46 +01:00
Emilia Hane	654e59cbba	Fix rename fn bug Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:46 +01:00
Emilia Hane	b4ec4c1ccf	Less strict handling of faulty rpc req params and syntax improvement	2023-01-20 21:16:46 +01:00
Emilia Hane	9445ac70d8	Check data availability boundary in rpc request	2023-01-20 21:16:46 +01:00
realbigsean	3cb8fb7973	block wrapper refactor initial commit	2023-01-20 11:50:16 -05:00
Age Manning	f8a3b3b95a	Improve block delay metrics (#3894 ) We recently ran a large-block experiment on the testnet and plan to do a further experiment on mainnet. Although the metrics recovered from lighthouse nodes were quite useful, I think we could do with greater resolution in the block delay metrics and get some specific values for each block (currently these can be lost to large exponential histogram buckets). This PR increases the resolution of the block delay histogram buckets, but also introduces a new metric which records the last block delay. Depending on the polling resolution of the metric server, we can lose some block delay information, however it will always give us a specific value and we will not lose exact data based on poor resolution histogram buckets.	2023-01-20 00:46:56 +00:00
realbigsean	ddcd10b194	merge latest capella changes	2023-01-16 09:17:18 -05:00
realbigsean	1319683736	Update gossip_methods.rs	2023-01-13 14:59:03 -05:00
Mark Mackey	05c1291d8a	Don't Penalize Early `bls_to_execution_change`	2023-01-13 12:53:25 -06:00
realbigsean	06f71e8cce	merge capella	2023-01-12 12:51:09 -05:00
Michael Sproul	2af8110529	Merge remote-tracking branch 'origin/unstable' into capella Fixing the conflicts involved patching up some of the `block_hash` verification, the rest will be done as part of https://github.com/sigp/lighthouse/issues/3870	2023-01-12 16:22:00 +11:00
realbigsean	438126f19a	merge upstream, fix compile errors	2023-01-11 13:52:58 -05:00
Paul Hauner	830efdb5c2	Improve validator monitor experience for high validator counts (#3728 ) ## Issue Addressed NA ## Proposed Changes Myself and others (#3678) have observed that when running with lots of validators (e.g., 1000s) the cardinality is too much for Prometheus. I've seen Prometheus instances just grind to a halt when we turn the validator monitor on for our testnet validators (we have 10,000s of Goerli validators). Additionally, the debug log volume can get very high with one log per validator, per attestation. To address this, the `bn --validator-monitor-individual-tracking-threshold <INTEGER>` flag has been added to disable per-validator (i.e., non-aggregated) metrics/logging once the validator monitor exceeds the threshold of validators. The default value is `64`, which is a finger-to-the-wind value. I don't actually know the value at which Prometheus starts to become overwhelmed, but I've seen it work with ~64 validators and I've seen it not work with 1000s of validators. A default of `64` seems like it will result in a breaking change to users who are running millions of dollars worth of validators whilst resulting in a no-op for low-validator-count users. I'm open to changing this number, though. Additionally, this PR starts collecting aggregated Prometheus metrics (e.g., total count of head hits across all validators), so that high-validator-count validators still have some interesting metrics. We already had logging for aggregated values, so nothing has been added there. I've opted to make this a breaking change since it can be rather damaging to your Prometheus instance to accidentally enable the validator monitor with large numbers of validators. I've crashed a Prometheus instance myself and had a report from another user who's done the same thing. ## Additional Info NA ## Breaking Changes Note A new label has been added to the validator monitor Prometheus metrics: `total`. This label tracks the aggregated metrics of all validators in the validator monitor (as opposed to each validator being tracking individually using its pubkey as the label). Additionally, a new flag has been added to the Beacon Node: `--validator-monitor-individual-tracking-threshold`. The default value is `64`, which means that when the validator monitor is tracking more than 64 validators then it will stop tracking per-validator metrics and only track the `all_validators` metric. It will also stop logging per-validator logs and only emit aggregated logs (the exception being that exit and slashing logs are always emitted). These changes were introduced in #3728 to address issues with untenable Prometheus cardinality and log volume when using the validator monitor with high validator counts (e.g., 1000s of validators). Users with less than 65 validators will see no change in behavior (apart from the added `all_validators` metric). Users with more than 65 validators who wish to maintain the previous behavior can set something like `--validator-monitor-individual-tracking-threshold 999999`.	2023-01-09 08:18:55 +00:00
Michael Sproul	4bd2b777ec	Verify execution block hashes during finalized sync (#3794 ) ## Issue Addressed Recent discussions with other client devs about optimistic sync have revealed a conceptual issue with the optimisation implemented in #3738. In designing that feature I failed to consider that the execution node checks the `blockHash` of the execution payload before responding with `SYNCING`, and that omitting this check entirely results in a degradation of the full node's validation. A node omitting the `blockHash` checks could be tricked by a supermajority of validators into following an invalid chain, something which is ordinarily impossible. ## Proposed Changes I've added verification of the `payload.block_hash` in Lighthouse. In case of failure we log a warning and fall back to verifying the payload with the execution client. I've used our existing dependency on `ethers_core` for RLP support, and a new dependency on Parity's `triehash` crate for the Merkle patricia trie. Although the `triehash` crate is currently unmaintained it seems like our best option at the moment (it is also used by Reth, and requires vastly less boilerplate than Parity's generic `trie-root` library). Block hash verification is pretty quick, about 500us per block on my machine (mainnet). The optimistic finalized sync feature can be disabled using `--disable-optimistic-finalized-sync` which forces full verification with the EL. ## Additional Info This PR also introduces a new dependency on our [`metastruct`](https://github.com/sigp/metastruct) library, which was perfectly suited to the RLP serialization method. There will likely be changes as `metastruct` grows, but I think this is a good way to start dogfooding it. I took inspiration from some Parity and Reth code while writing this, and have preserved the relevant license headers on the files containing code that was copied and modified.	2023-01-09 03:11:59 +00:00
Emilia Hane	c44738c77b	Undo response modification in commit `597363d2f`	2023-01-06 12:42:21 +01:00
Emilia Hane	74bca46fc2	Fix bug of early termination of batch send	2023-01-06 11:45:13 +01:00
Emilia Hane	597363d2f9	Don't send empty blobs sidecar for blobs by range request	2023-01-05 16:28:59 +01:00
realbigsean	d8f7277beb	cleanup	2022-12-30 11:00:14 -05:00
sean	40c6daa34b	add pawan's suggestsion	2022-12-28 18:27:21 +00:00
realbigsean	8a70d80a2f	Revert "Revert "renames, remove , wrap BlockWrapper enum to make descontruction private"" This reverts commit `1931a442dc`.	2022-12-28 10:31:18 -05:00
realbigsean	1931a442dc	Revert "renames, remove , wrap BlockWrapper enum to make descontruction private" This reverts commit `5b3b34a9d7`.	2022-12-28 10:30:36 -05:00
realbigsean	5b3b34a9d7	renames, remove , wrap BlockWrapper enum to make descontruction private	2022-12-28 10:28:45 -05:00
realbigsean	502b5e5bf0	unused error lint	2022-12-28 09:32:29 -05:00
Diva M	6bf439befd	Merge branch 'eip4844' into empty-blobs	2022-12-23 17:38:59 -05:00
Divma	240854750c	cleanup: remove unused imports, unusued fields (#3834 )	2022-12-23 17:16:10 -05:00
realbigsean	5e11edc612	fix blob validation for empty blobs	2022-12-23 12:47:38 -05:00
Diva M	24087f104d	add the batch type to the Batch's KV	2022-12-23 10:49:46 -05:00
Diva M	901764b8f0	backfill batches need to be of just one epoch	2022-12-23 10:32:59 -05:00
realbigsean	f45d117e73	merge with capella	2022-12-23 10:21:18 -05:00
realbigsean	4d50fa36bc	Merge pull request #3829 from divagant-martian/handle-no-blob-range-response Handle peers sending no blob when the blob is empty in range responses	2022-12-23 10:15:30 -05:00
Diva M	66f9aa922d	clean up and improvements	2022-12-23 09:52:10 -05:00
Diva M	3643f5cc19	spelling	2022-12-22 17:47:36 -05:00
Diva M	48ff56d9cb	spelling	2022-12-22 17:38:55 -05:00
Diva M	e24f6c93d9	fix ctrl c'd comment	2022-12-22 17:38:16 -05:00
Diva M	fbc147e273	remove unused entry struct	2022-12-22 17:34:01 -05:00
Diva M	cd6655dba9	handle no blobs from peers instead of empty blobs in range requests	2022-12-22 17:30:04 -05:00
realbigsean	61763790d5	Merge pull request #3825 from jimmygchen/small-fixes Various small fixes to 4844 branch	2022-12-22 17:12:09 -05:00
realbigsean	33d01a7911	miscelaneous fixes on syncing, rpc and responding to peer's sync related requests (#3827 ) - there was a bug in responding range blob requests where we would incorrectly label the first slot of an epoch as a non-skipped slot if it were skipped. this bug did not exist in the code for responding to block range request because the logic error was mitigated by defensive coding elsewhere - there was a bug where a block received during range sync without a corresponding blob (and vice versa) was incorrectly interpreted as a stream termination - RPC size limit fixes. - Our blob cache was dead locking so I removed use of it for now. - Because of our change in finalized sync batch size from 2 to 1 and our transition to using exact epoch boundaries for batches (rather than one slot past the epoch boundary), we need to sync finalized sync to 2 epochs + 1 slot past our peer's finalized slot in order to finalize the chain locally. - use fork context bytes in rpc methods on both the server and client side	2022-12-21 15:50:51 -05:00
Jimmy Chen	f7bb458c5e	Fix incorrect logging	2022-12-22 02:01:11 +11:00

1 2 3 4 5 ...

811 Commits