lighthouse

Author	SHA1	Message	Date
Emilia Hane	8365d76277	fixup! Debug tests	2023-02-10 09:39:22 +01:00
Emilia Hane	16cb9cfca2	fixup! Debug tests	2023-02-10 09:39:22 +01:00
Emilia Hane	7220f35ff6	Debug tests	2023-02-10 09:39:21 +01:00
Emilia Hane	995b2715f2	Fix network block_lookups test	2023-02-10 09:39:21 +01:00
Emilia Hane	3676ce78b5	Fix rebase conflicts	2023-02-10 09:39:21 +01:00
Emilia Hane	56c84178f2	Fix conflicts rebasing eip4844	2023-02-08 11:44:44 +01:00
realbigsean	a42d07592c	fix compilation issues after merge	2023-02-07 12:33:29 -05:00
realbigsean	26a296246d	Merge branch 'capella' of https://github.com/sigp/lighthouse into eip4844 # Conflicts: # beacon_node/beacon_chain/src/beacon_chain.rs # beacon_node/beacon_chain/src/block_verification.rs # beacon_node/beacon_chain/src/test_utils.rs # beacon_node/execution_layer/src/engine_api.rs # beacon_node/execution_layer/src/engine_api/http.rs # beacon_node/execution_layer/src/lib.rs # beacon_node/execution_layer/src/test_utils/handle_rpc.rs # beacon_node/http_api/src/lib.rs # beacon_node/http_api/tests/fork_tests.rs # beacon_node/network/src/beacon_processor/mod.rs # beacon_node/network/src/beacon_processor/work_reprocessing_queue.rs # beacon_node/network/src/beacon_processor/worker/sync_methods.rs # beacon_node/operation_pool/src/bls_to_execution_changes.rs # beacon_node/operation_pool/src/lib.rs # beacon_node/operation_pool/src/persistence.rs # consensus/serde_utils/src/u256_hex_be_opt.rs # testing/antithesis/Dockerfile.libvoidstar	2023-02-07 12:12:56 -05:00
Paul Hauner	e062a7cf76	Broadcast address changes at Capella (#3919 ) * Add first efforts at broadcast * Tidy * Move broadcast code to client * Progress with broadcast impl * Rename to address change * Fix compile errors * Use `while` loop * Tidy * Flip broadcast condition * Switch to forgetting individual indices * Always broadcast when the node starts * Refactor into two functions * Add testing * Add another test * Tidy, add more testing * Tidy * Add test, rename enum * Rename enum again * Tidy * Break loop early * Add V15 schema migration * Bump schema version * Progress with migration * Update beacon_node/client/src/address_change_broadcast.rs Co-authored-by: Michael Sproul <micsproul@gmail.com> * Fix typo in function name --------- Co-authored-by: Michael Sproul <micsproul@gmail.com>	2023-02-07 17:13:49 +11:00
realbigsean	37e7c1d5c7	keep verification of payloads pre 4844	2023-01-27 17:59:40 +01:00
realbigsean	7c8d97c06e	remove unused import	2023-01-25 14:26:01 +01:00
GeemoCandama	f857811e5f	light client optimistic update reprocessing (#3799 ) Currently there is a race between receiving blocks and receiving light client optimistic updates (in unstable), which results in processing errors. This is a continuation of PR #3693 and seeks to progress on issue #3651 Add the parent_root to ReprocessQueueMessage::BlockImported so we can remove blocks from queue when a block arrives that has the same parent root. We use the parent root as opposed to the block_root because the LightClientOptimisticUpdate does not contain the block_root. If light_client_optimistic_update.attested_header.canonical_root() != head_block.message().parent_root() then we queue the update. Otherwise we process immediately. michaelsproul came up with this idea. The code was heavily based off of the attestation reprocessing. I have not properly tested this to see if it works as intended.	2023-01-25 14:23:33 +01:00
Michael Sproul	a4cfe50ade	Import BLS to execution changes before Capella (#3892 ) * Import BLS to execution changes before Capella * Test for BLS to execution change HTTP API * Pack BLS to execution changes in LIFO order * Remove unused var * Clippy	2023-01-25 14:21:54 +01:00
Age Manning	528f7181bc	Improve block delay metrics (#3894 ) We recently ran a large-block experiment on the testnet and plan to do a further experiment on mainnet. Although the metrics recovered from lighthouse nodes were quite useful, I think we could do with greater resolution in the block delay metrics and get some specific values for each block (currently these can be lost to large exponential histogram buckets). This PR increases the resolution of the block delay histogram buckets, but also introduces a new metric which records the last block delay. Depending on the polling resolution of the metric server, we can lose some block delay information, however it will always give us a specific value and we will not lose exact data based on poor resolution histogram buckets.	2023-01-25 14:21:53 +01:00
realbigsean	5e8d79891b	merge conflict resolution	2023-01-25 11:10:44 +01:00
Michael Sproul	c76a1971cc	Merge remote-tracking branch 'origin/unstable' into capella	2023-01-25 14:20:16 +11:00
GeemoCandama	a7351c00c0	light client optimistic update reprocessing (#3799 ) ## Issue Addressed Currently there is a race between receiving blocks and receiving light client optimistic updates (in unstable), which results in processing errors. This is a continuation of PR #3693 and seeks to progress on issue #3651 ## Proposed Changes Add the parent_root to ReprocessQueueMessage::BlockImported so we can remove blocks from queue when a block arrives that has the same parent root. We use the parent root as opposed to the block_root because the LightClientOptimisticUpdate does not contain the block_root. If light_client_optimistic_update.attested_header.canonical_root() != head_block.message().parent_root() then we queue the update. Otherwise we process immediately. ## Additional Info michaelsproul came up with this idea. The code was heavily based off of the attestation reprocessing. I have not properly tested this to see if it works as intended.	2023-01-24 22:17:50 +00:00
realbigsean	d3240c1ffb	fix common issue across blocks by range and blobs by range	2023-01-24 15:42:28 +01:00
realbigsean	18d4faf611	review updates	2023-01-24 15:30:29 +01:00
realbigsean	2225e6ac89	pass in data availability boundary to the get_blobs method	2023-01-24 14:35:07 +01:00
realbigsean	b658cc7aaf	simplify checking attester cache for block and blobs. use ResourceUnavailable according to the spec	2023-01-24 10:50:47 +01:00
Emilia Hane	e14550425d	Fix mismatched response bug	2023-01-23 13:23:04 +01:00
Emilia Hane	81a754577d	fixup! Improve error handling	2023-01-21 15:47:33 +01:00
Emilia Hane	f32f08eec0	Fix typo	2023-01-21 14:47:14 +01:00
Emilia Hane	5fc648217d	fixup! Improve error handling	2023-01-21 14:46:24 +01:00
realbigsean	cbd09dc281	finish refactor	2023-01-21 04:48:25 -05:00
Michael Sproul	d8abf2fc41	Import BLS to execution changes before Capella (#3892 ) * Import BLS to execution changes before Capella * Test for BLS to execution change HTTP API * Pack BLS to execution changes in LIFO order * Remove unused var * Clippy	2023-01-21 10:39:59 +11:00
Michael Sproul	bb0e99c097	Merge remote-tracking branch 'origin/unstable' into capella	2023-01-21 10:37:26 +11:00
Emilia Hane	f7eb89ddd9	Improve error handling	2023-01-20 21:16:47 +01:00
realbigsean	c6479444c2	don't send errors when we correctly don't have blobs	2023-01-20 21:16:47 +01:00
realbigsean	e1ce4e5b78	make explicity BlobsUnavailable error and handle it directly	2023-01-20 21:16:47 +01:00
realbigsean	f7f64eb007	fix/consolidate some error handling	2023-01-20 21:16:47 +01:00
Emilia Hane	89cb58d17b	Fix typo Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:47 +01:00
Emilia Hane	9cc25162e2	Send error message if eip4844 fork disabled Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:46 +01:00
Emilia Hane	654e59cbba	Fix rename fn bug Co-authored-by: realbigsean <seananderson33@GMAIL.com>	2023-01-20 21:16:46 +01:00
Emilia Hane	b4ec4c1ccf	Less strict handling of faulty rpc req params and syntax improvement	2023-01-20 21:16:46 +01:00
Emilia Hane	9445ac70d8	Check data availability boundary in rpc request	2023-01-20 21:16:46 +01:00
realbigsean	3cb8fb7973	block wrapper refactor initial commit	2023-01-20 11:50:16 -05:00
Age Manning	f8a3b3b95a	Improve block delay metrics (#3894 ) We recently ran a large-block experiment on the testnet and plan to do a further experiment on mainnet. Although the metrics recovered from lighthouse nodes were quite useful, I think we could do with greater resolution in the block delay metrics and get some specific values for each block (currently these can be lost to large exponential histogram buckets). This PR increases the resolution of the block delay histogram buckets, but also introduces a new metric which records the last block delay. Depending on the polling resolution of the metric server, we can lose some block delay information, however it will always give us a specific value and we will not lose exact data based on poor resolution histogram buckets.	2023-01-20 00:46:56 +00:00
realbigsean	ddcd10b194	merge latest capella changes	2023-01-16 09:17:18 -05:00
realbigsean	1319683736	Update gossip_methods.rs	2023-01-13 14:59:03 -05:00
Mark Mackey	05c1291d8a	Don't Penalize Early `bls_to_execution_change`	2023-01-13 12:53:25 -06:00
realbigsean	06f71e8cce	merge capella	2023-01-12 12:51:09 -05:00
Michael Sproul	2af8110529	Merge remote-tracking branch 'origin/unstable' into capella Fixing the conflicts involved patching up some of the `block_hash` verification, the rest will be done as part of https://github.com/sigp/lighthouse/issues/3870	2023-01-12 16:22:00 +11:00
realbigsean	438126f19a	merge upstream, fix compile errors	2023-01-11 13:52:58 -05:00
Paul Hauner	830efdb5c2	Improve validator monitor experience for high validator counts (#3728 ) ## Issue Addressed NA ## Proposed Changes Myself and others (#3678) have observed that when running with lots of validators (e.g., 1000s) the cardinality is too much for Prometheus. I've seen Prometheus instances just grind to a halt when we turn the validator monitor on for our testnet validators (we have 10,000s of Goerli validators). Additionally, the debug log volume can get very high with one log per validator, per attestation. To address this, the `bn --validator-monitor-individual-tracking-threshold <INTEGER>` flag has been added to disable per-validator (i.e., non-aggregated) metrics/logging once the validator monitor exceeds the threshold of validators. The default value is `64`, which is a finger-to-the-wind value. I don't actually know the value at which Prometheus starts to become overwhelmed, but I've seen it work with ~64 validators and I've seen it not work with 1000s of validators. A default of `64` seems like it will result in a breaking change to users who are running millions of dollars worth of validators whilst resulting in a no-op for low-validator-count users. I'm open to changing this number, though. Additionally, this PR starts collecting aggregated Prometheus metrics (e.g., total count of head hits across all validators), so that high-validator-count validators still have some interesting metrics. We already had logging for aggregated values, so nothing has been added there. I've opted to make this a breaking change since it can be rather damaging to your Prometheus instance to accidentally enable the validator monitor with large numbers of validators. I've crashed a Prometheus instance myself and had a report from another user who's done the same thing. ## Additional Info NA ## Breaking Changes Note A new label has been added to the validator monitor Prometheus metrics: `total`. This label tracks the aggregated metrics of all validators in the validator monitor (as opposed to each validator being tracking individually using its pubkey as the label). Additionally, a new flag has been added to the Beacon Node: `--validator-monitor-individual-tracking-threshold`. The default value is `64`, which means that when the validator monitor is tracking more than 64 validators then it will stop tracking per-validator metrics and only track the `all_validators` metric. It will also stop logging per-validator logs and only emit aggregated logs (the exception being that exit and slashing logs are always emitted). These changes were introduced in #3728 to address issues with untenable Prometheus cardinality and log volume when using the validator monitor with high validator counts (e.g., 1000s of validators). Users with less than 65 validators will see no change in behavior (apart from the added `all_validators` metric). Users with more than 65 validators who wish to maintain the previous behavior can set something like `--validator-monitor-individual-tracking-threshold 999999`.	2023-01-09 08:18:55 +00:00
Michael Sproul	4bd2b777ec	Verify execution block hashes during finalized sync (#3794 ) ## Issue Addressed Recent discussions with other client devs about optimistic sync have revealed a conceptual issue with the optimisation implemented in #3738. In designing that feature I failed to consider that the execution node checks the `blockHash` of the execution payload before responding with `SYNCING`, and that omitting this check entirely results in a degradation of the full node's validation. A node omitting the `blockHash` checks could be tricked by a supermajority of validators into following an invalid chain, something which is ordinarily impossible. ## Proposed Changes I've added verification of the `payload.block_hash` in Lighthouse. In case of failure we log a warning and fall back to verifying the payload with the execution client. I've used our existing dependency on `ethers_core` for RLP support, and a new dependency on Parity's `triehash` crate for the Merkle patricia trie. Although the `triehash` crate is currently unmaintained it seems like our best option at the moment (it is also used by Reth, and requires vastly less boilerplate than Parity's generic `trie-root` library). Block hash verification is pretty quick, about 500us per block on my machine (mainnet). The optimistic finalized sync feature can be disabled using `--disable-optimistic-finalized-sync` which forces full verification with the EL. ## Additional Info This PR also introduces a new dependency on our [`metastruct`](https://github.com/sigp/metastruct) library, which was perfectly suited to the RLP serialization method. There will likely be changes as `metastruct` grows, but I think this is a good way to start dogfooding it. I took inspiration from some Parity and Reth code while writing this, and have preserved the relevant license headers on the files containing code that was copied and modified.	2023-01-09 03:11:59 +00:00
Emilia Hane	c44738c77b	Undo response modification in commit `597363d2f`	2023-01-06 12:42:21 +01:00
Emilia Hane	74bca46fc2	Fix bug of early termination of batch send	2023-01-06 11:45:13 +01:00
Emilia Hane	597363d2f9	Don't send empty blobs sidecar for blobs by range request	2023-01-05 16:28:59 +01:00
realbigsean	d8f7277beb	cleanup	2022-12-30 11:00:14 -05:00
sean	40c6daa34b	add pawan's suggestsion	2022-12-28 18:27:21 +00:00
realbigsean	8a70d80a2f	Revert "Revert "renames, remove , wrap BlockWrapper enum to make descontruction private"" This reverts commit `1931a442dc`.	2022-12-28 10:31:18 -05:00
realbigsean	1931a442dc	Revert "renames, remove , wrap BlockWrapper enum to make descontruction private" This reverts commit `5b3b34a9d7`.	2022-12-28 10:30:36 -05:00
realbigsean	5b3b34a9d7	renames, remove , wrap BlockWrapper enum to make descontruction private	2022-12-28 10:28:45 -05:00
realbigsean	502b5e5bf0	unused error lint	2022-12-28 09:32:29 -05:00
Diva M	6bf439befd	Merge branch 'eip4844' into empty-blobs	2022-12-23 17:38:59 -05:00
Divma	240854750c	cleanup: remove unused imports, unusued fields (#3834 )	2022-12-23 17:16:10 -05:00
realbigsean	5e11edc612	fix blob validation for empty blobs	2022-12-23 12:47:38 -05:00
Diva M	24087f104d	add the batch type to the Batch's KV	2022-12-23 10:49:46 -05:00
Diva M	901764b8f0	backfill batches need to be of just one epoch	2022-12-23 10:32:59 -05:00
realbigsean	f45d117e73	merge with capella	2022-12-23 10:21:18 -05:00
realbigsean	4d50fa36bc	Merge pull request #3829 from divagant-martian/handle-no-blob-range-response Handle peers sending no blob when the blob is empty in range responses	2022-12-23 10:15:30 -05:00
Diva M	66f9aa922d	clean up and improvements	2022-12-23 09:52:10 -05:00
Diva M	3643f5cc19	spelling	2022-12-22 17:47:36 -05:00
Diva M	48ff56d9cb	spelling	2022-12-22 17:38:55 -05:00
Diva M	e24f6c93d9	fix ctrl c'd comment	2022-12-22 17:38:16 -05:00
Diva M	fbc147e273	remove unused entry struct	2022-12-22 17:34:01 -05:00
Diva M	cd6655dba9	handle no blobs from peers instead of empty blobs in range requests	2022-12-22 17:30:04 -05:00
realbigsean	61763790d5	Merge pull request #3825 from jimmygchen/small-fixes Various small fixes to 4844 branch	2022-12-22 17:12:09 -05:00
realbigsean	33d01a7911	miscelaneous fixes on syncing, rpc and responding to peer's sync related requests (#3827 ) - there was a bug in responding range blob requests where we would incorrectly label the first slot of an epoch as a non-skipped slot if it were skipped. this bug did not exist in the code for responding to block range request because the logic error was mitigated by defensive coding elsewhere - there was a bug where a block received during range sync without a corresponding blob (and vice versa) was incorrectly interpreted as a stream termination - RPC size limit fixes. - Our blob cache was dead locking so I removed use of it for now. - Because of our change in finalized sync batch size from 2 to 1 and our transition to using exact epoch boundaries for batches (rather than one slot past the epoch boundary), we need to sync finalized sync to 2 epochs + 1 slot past our peer's finalized slot in order to finalize the chain locally. - use fork context bytes in rpc methods on both the server and client side	2022-12-21 15:50:51 -05:00
Jimmy Chen	f7bb458c5e	Fix incorrect logging	2022-12-22 02:01:11 +11:00
Jimmy Chen	ccfd092845	Fix blob request logging and incorrect enum type	2022-12-22 00:22:37 +11:00
realbigsean	5de4f5b8d0	handle parent blob request edge cases correctly. fix data availability boundary check	2022-12-19 11:39:09 -05:00
Mark Mackey	3e90fb8cae	Merge branch 'unstable' into capella	2022-12-15 12:20:03 -06:00
realbigsean	1644978cdb	fix compilation	2022-12-15 10:26:10 -05:00
realbigsean	d893706e0e	merge with capella	2022-12-15 09:33:18 -05:00
Divma	63c74b37f4	send error answering bbrange requests when an error occurrs (#3800 ) ## Issue Addressed While testing withdrawals with @ethDreamer we noticed lighthouse is sending empty batches when an error occurs. As LH peer receiving this, we would consider this a low tolerance action because the peer is claiming the batch is right and is empty. ## Proposed Changes If any kind of error occurs, send a error response instead ## Additional Info Right now we don't handle such thing as a partial batch with an error. If an error is received, the whole batch is discarded. Because of this it makes little sense to send partial batches that end with an error, so it's better to do the proposed solution instead of sending empty batches.	2022-12-15 00:16:38 +00:00
Michael Sproul	991e4094f8	Merge remote-tracking branch 'origin/unstable' into capella-update	2022-12-14 13:00:41 +11:00
GeemoCandama	1b28ef8a8d	Adding light_client gossip topics (#3693 ) ## Issue Addressed Implementing the light_client_gossip topics but I'm not there yet. Which issue # does this PR address? Partially #3651 ## Proposed Changes Add light client gossip topics. Please list or describe the changes introduced by this PR. I'm going to Implement light_client_finality_update and light_client_optimistic_update gossip topics. Currently I've attempted the former and I'm seeking feedback. ## Additional Info I've only implemented the light_client_finality_update topic because I wanted to make sure I was on the correct path. Also checking that the gossiped LightClientFinalityUpdate is the same as the locally constructed one is not implemented because caching the updates will make this much easier. Could someone give me some feedback on this please? Please provide any additional information. For example, future considerations or information useful for reviewers. Co-authored-by: GeemoCandama <104614073+GeemoCandama@users.noreply.github.com>	2022-12-13 06:24:51 +00:00
realbigsean	5a42f6b067	range block or block+blob requests	2022-12-07 15:35:46 -05:00
realbigsean	a0d4aecf30	requests block + blob always post eip4844	2022-12-07 15:30:08 -05:00
realbigsean	6d4fb41b84	fix blob slot validation	2022-12-07 13:49:24 -05:00
realbigsean	6c8b1b323b	merge upstream	2022-12-07 12:27:21 -05:00
ethDreamer	1a39976715	Fixed Compiler Warnings & Failing Tests (#3771 )	2022-12-03 10:42:12 +11:00
realbigsean	8102a01085	merge with upstream	2022-12-01 11:13:07 -05:00
Mark Mackey	8a04c3428e	Merged with `unstable`	2022-11-30 17:29:10 -06:00
Diva M	979a95d62f	handle unknown parents for block-blob pairs wip handle unknown parents for block-blob pairs	2022-11-30 17:21:54 -05:00
realbigsean	2157d91b43	process single block and blob	2022-11-30 11:51:18 -05:00
realbigsean	fc9d0a512d	handle blobs by range requests	2022-11-30 10:02:29 -05:00
realbigsean	422d145902	chain segment processing for blobs	2022-11-30 09:40:15 -05:00
GeemoCandama	3534c85e30	Optimize finalized chain sync by skipping newPayload messages (#3738 ) ## Issue Addressed #3704 ## Proposed Changes Adds is_syncing_finalized: bool parameter for block verification functions. Sets the payload_verification_status to Optimistic if is_syncing_finalized is true. Uses SyncState in NetworkGlobals in BeaconProcessor to retrieve the syncing status. ## Additional Info I could implement FinalizedSignatureVerifiedBlock if you think it would be nicer.	2022-11-29 08:19:27 +00:00
Diva M	e548073602	Merge branch 'blob-syncing' into eip4844-devnet-v3	2022-11-28 15:10:50 -05:00
Diva M	805df307f6	wip	2022-11-28 14:13:12 -05:00
realbigsean	3c9e1abcb7	merge upstream	2022-11-26 10:01:57 -05:00
antondlr	e9bf7f7cc1	remove commas from comma-separated kv pairs (#3737 ) ## Issue Addressed Logs are in comma separated kv list, but the values sometimes contain commas, which breaks parsing	2022-11-25 07:57:10 +00:00
Giulio rebuffo	d5a2de759b	Added LightClientBootstrap V1 (#3711 ) ## Issue Addressed Partially addresses #3651 ## Proposed Changes Adds server-side support for light_client_bootstrap_v1 topic ## Additional Info This PR, creates each time a bootstrap without using cache, I do not know how necessary a cache is in this case as this topic is not supposed to be called frequently and IMHO we can just prevent abuse by using the limiter, but let me know what you think or if there is any caveat to this, or if it is necessary only for the sake of good practice. Co-authored-by: Pawan Dhananjay <pawandhananjay@gmail.com>	2022-11-25 05:19:00 +00:00
Michael Sproul	788b337951	Op pool and gossip for BLS to execution changes (#3726 )	2022-11-25 07:09:26 +11:00
realbigsean	1222404450	Merge branch 'blob-syncing' of https://github.com/realbigsean/lighthouse into blob-sync-kzg	2022-11-24 07:46:04 -05:00
Divma	bf5005244e	Blob syncing (#24 ) * add a rt is_blob_batch * use the mixed type everywhere * glue * more glue * minor fixes * fix range tests * filling in the gaps * moore filling in the gaps	2022-11-24 07:45:38 -05:00
realbigsean	beddcfaac2	get spec tests working and fix json serialization	2022-11-23 18:30:45 -05:00
Diva M	7ed2d35424	get it to compile	2022-11-21 14:53:33 -05:00
realbigsean	e7ee79185b	add blobs cache and fix some block production	2022-11-21 14:09:06 -05:00
realbigsean	dc87156641	block and blob handling progress	2022-11-19 16:53:34 -05:00
realbigsean	45897ad4e1	remove blob wrapper	2022-11-19 15:18:42 -05:00
Diva M	78c72158c8	toy skelleton of sync changes	2022-11-16 13:53:38 -05:00
realbigsean	7162e5e23b	add a bunch of blob coupling boiler plate, add a blobs by root request	2022-11-15 16:43:56 -05:00
realbigsean	fe04d945cc	make signed block + sidecar consensus spec	2022-11-10 14:22:30 -05:00
Divma	84c7d8cc70	Blocklookup data inconsistencies (#3677 ) ## Issue Addressed Closes #3649 ## Proposed Changes Add a regression test for the data inconsistency, catching the problem in `31e88c5533` [here](https://github.com/sigp/lighthouse/actions/runs/3379894044/jobs/5612044797#step:6:2043). When a chain is sent for processing, move it to a separate collection and now the test works, yay! ## Additional Info na	2022-11-07 06:48:34 +00:00
realbigsean	d8a49aad2b	merge with unstable fixes	2022-11-01 13:26:56 -04:00
realbigsean	8656d23327	merge with unstable	2022-11-01 13:18:00 -04:00
Pawan Dhananjay	29f2ec46d3	Couple blocks and blobs in gossip (#3670 ) * Revert "Add more gossip verification conditions" This reverts commit `1430b561c3`. * Revert "Add todos" This reverts commit `91efb9d4c7`. * Revert "Reprocess blob sidecar messages" This reverts commit `21bf3d37cd`. * Add the coupled topic * Decode SignedBeaconBlockAndBlobsSidecar correctly * Process Block and Blobs in beacon processor * Remove extra blob publishing logic from vc * Remove blob signing in vc * Ugly hack to compile	2022-11-01 10:28:21 -04:00
realbigsean	137f230344	Capella eip 4844 cleanup (#3652 ) * add capella gossip boiler plate * get everything compiling Co-authored-by: realbigsean <sean@sigmaprime.io Co-authored-by: Mark Mackey <mark@sigmaprime.io> * small cleanup * small cleanup * cargo fix + some test cleanup * improve block production * add fixme for potential panic Co-authored-by: Mark Mackey <mark@sigmaprime.io>	2022-10-26 15:15:26 -04:00
ethDreamer	255fdf0724	Added Capella Data Structures to consensus/types (#3637 ) * Ran Cargo fmt * Added Capella Data Structures to consensus/types	2022-10-13 09:37:20 -05:00
realbigsean	44515b8cbe	cargo fix	2022-10-05 17:20:54 -04:00
Pawan Dhananjay	21bf3d37cd	Reprocess blob sidecar messages	2022-10-05 02:52:26 -05:00
Pawan Dhananjay	12fe514550	Add more gossip verification functions for blobs	2022-10-04 19:17:53 -05:00
realbigsean	7527c2b455	fix RPC limit add blob signing domain	2022-10-04 14:57:29 -04:00
realbigsean	ba16a037a3	cleanup	2022-10-04 09:34:05 -04:00
realbigsean	c0dc42ea07	cargo fmt	2022-10-04 08:21:46 -04:00
Divma	4926e3967f	[DEV FEATURE] Deterministic long lived subnets (#3453 ) ## Issue Addressed #2847 ## Proposed Changes Add under a feature flag the required changes to subscribe to long lived subnets in a deterministic way ## Additional Info There is an additional required change that is actually searching for peers using the prefix, but I find that it's best to make this change in the future	2022-10-04 10:37:48 +00:00
realbigsean	8d45e48775	cargo fix	2022-10-03 21:52:16 -04:00
realbigsean	e81dbbfea4	compile	2022-10-03 21:48:02 -04:00
realbigsean	88006735c4	compile	2022-10-03 10:06:04 -04:00
realbigsean	7520651515	cargo fix and some test fixes	2022-09-29 12:43:35 -04:00
realbigsean	fe6fc55449	fix compilation errors, rename capella -> shanghai, cleanup some rebase issues	2022-09-29 12:43:13 -04:00
realbigsean	3f1e5cee78	Some gossip work	2022-09-29 12:35:53 -04:00
realbigsean	ebc0ccd02a	some more sync boilerplate	2022-09-29 12:34:09 -04:00
realbigsean	4008da6c60	sync tx blobs	2022-09-29 12:32:55 -04:00
Age Manning	01b6bf7a2d	Improve logging a little (#3619 ) Some of the logs in combination with others could be improved. It will save some time debugging by improving the wording slightly.	2022-09-29 01:50:12 +00:00
Divma	b1d2510d1b	Libp2p v0.48.0 upgrade (#3547 ) ## Issue Addressed Upgrades libp2p to v.0.47.0. This is the compilation of - [x] #3495 - [x] #3497 - [x] #3491 - [x] #3546 - [x] #3553 Co-authored-by: Age Manning <Age@AgeManning.com>	2022-09-29 01:50:11 +00:00
Divma	9bd384a573	send attnet unsubscription event on random subnet expiry (#3600 ) ## Issue Addressed 🐞 in which we don't actually unsubscribe from a random long lived subnet when it expires ## Proposed Changes Remove code addressing a specific case in which we are subscribed to all subnets and handle the removal of the long lived subnet. I don't think the special case code is particularly important as, if someone is running with that many validators to be subscribed to all subnets, it should use `--subscribe-all-subnets` instead ## Additional Info Noticed on some test nodes climbing bandwidth usage periodically (around 27hours, the time of subnet expirations) I'm running this code to test this does not happen anymore, but I think it should be good now	2022-09-23 03:52:45 +00:00
Paul Hauner	fa6ad1a11a	Deduplicate block root computation (#3590 ) ## Issue Addressed NA ## Proposed Changes This PR removes duplicated block root computation. Computing the `SignedBeaconBlock::canonical_root` has become more expensive since the merge as we need to compute the merke root of each transaction inside an `ExecutionPayload`. Computing the root for [a mainnet block](https://beaconcha.in/slot/4704236) is taking ~10ms on my i7-8700K CPU @ 3.70GHz (no sha extensions). Given that our median seen-to-imported time for blocks is presently 300-400ms, removing a few duplicated block roots (~30ms) could represent an easy 10% improvement. When we consider that the seen-to-imported times include operations after the block has been placed in the early attester cache, we could expect the 30ms to be more significant WRT our seen-to-attestable times. ## Additional Info NA	2022-09-23 03:52:42 +00:00
Marius van der Wijden	6f7d21c542	enable 4844 at epoch 3	2022-09-18 12:13:03 +02:00
Marius van der Wijden	285dbf43ed	hacky hacks	2022-09-18 11:34:46 +02:00
Marius van der Wijden	8b71b978e0	new round of hacks (config etc)	2022-09-17 23:42:49 +02:00
Daniel Knopik	750c594f5f	forgor something	2022-09-17 21:38:57 +02:00
Daniel Knopik	eab1fce0e5	Merge branch 'eip4844' of github.com:dknopik/lighthouse into eip4844	2022-09-17 20:55:36 +02:00
Daniel Knopik	76572db9d5	add network config	2022-09-17 20:55:21 +02:00
Marius van der Wijden	f43532d3de	implement handle blobs by range req	2022-09-17 20:05:51 +02:00
Marius van der Wijden	f9209e2d08	more network stuff	2022-09-17 16:39:40 +02:00
Marius van der Wijden	aeb52ff186	network stuff	2022-09-17 16:10:42 +02:00
Daniel Knopik	292a16a6eb	gossip boilerplate	2022-09-17 14:58:27 +02:00
Paul Hauner	2cd3e3a768	Avoid duplicate committee cache loads (#3574 ) ## Issue Addressed NA ## Proposed Changes I have observed scenarios on Goerli where Lighthouse was receiving attestations which reference the same, un-cached shuffling on multiple threads at the same time. Lighthouse was then loading the same state from database and determining the shuffling on multiple threads at the same time. This is unnecessary load on the disk and RAM. This PR modifies the shuffling cache so that each entry can be either: - A committee - A promise for a committee (i.e., a `crossbeam_channel::Receiver`) Now, in the scenario where we have thread A and thread B simultaneously requesting the same un-cached shuffling, we will have the following: 1. Thread A will take the write-lock on the shuffling cache, find that there's no cached committee and then create a "promise" (a `crossbeam_channel::Sender`) for a committee before dropping the write-lock. 1. Thread B will then be allowed to take the write-lock for the shuffling cache and find the promise created by thread A. It will block the current thread waiting for thread A to fulfill that promise. 1. Thread A will load the state from disk, obtain the shuffling, send it down the channel, insert the entry into the cache and then continue to verify the attestation. 1. Thread B will then receive the shuffling from the receiver, be un-blocked and then continue to verify the attestation. In the case where thread A fails to generate the shuffling and drops the sender, the next time that specific shuffling is requested we will detect that the channel is disconnected and return a `None` entry for that shuffling. This will cause the shuffling to be re-calculated. ## Additional Info NA	2022-09-16 08:54:03 +00:00
tim gretler	98815516a1	Support histogram buckets (#3391 ) ## Issue Addressed #3285 ## Proposed Changes Adds support for specifying histogram with buckets and adds new metric buckets for metrics mentioned in issue. ## Additional Info Need some help for the buckets. Co-authored-by: Michael Sproul <micsproul@gmail.com>	2022-09-13 01:57:44 +00:00
Divma	473abc14ca	Subscribe to subnets only when needed (#3419 ) ## Issue Addressed We currently subscribe to attestation subnets as soon as the subscription arrives (one epoch in advance), this makes it so that subscriptions for future slots are scheduled instead of done immediately. ## Proposed Changes - Schedule subscriptions to subnets for future slots. - Finish removing hashmap_delay, in favor of [delay_map](https://github.com/AgeManning/delay_map). This was the only remaining service to do this. - Subscriptions for past slots are rejected, before we would subscribe for one slot. - Add a new test for subscriptions that are not consecutive. ## Additional Info This is also an effort in making the code easier to understand	2022-09-05 00:22:48 +00:00
Paul Hauner	661307dce1	Separate committee subscriptions queue (#3508 ) ## Issue Addressed NA ## Proposed Changes As we've seen on Prater, there seems to be a correlation between these messages ``` WARN Not enough time for a discovery search subnet_id: ExactSubnet { subnet_id: SubnetId(19), slot: Slot(3742336) }, service: attestation_service ``` ... and nodes falling 20-30 slots behind the head for short periods. These nodes are running ~20k Prater validators. After running some metrics, I can see that the `network_recv` channel is processing ~250k `AttestationSubscribe` messages per minute. It occurred to me that perhaps the `AttestationSubscribe` messages are "washing out" the `SendRequest` and `SendResponse` messages. In this PR I separate the `AttestationSubscribe` and `SyncCommitteeSubscribe` messages into their own queue so the `tokio::select!` in the `NetworkService` can still process the other messages in the `network_recv` channel without necessarily having to clear all the subscription messages first. ~~I've also added filter to the HTTP API to prevent duplicate subscriptions going to the network service.~~ ## Additional Info - Currently being tested on Prater	2022-08-30 05:47:31 +00:00
Michael Sproul	66eca1a882	Refactor op pool for speed and correctness (#3312 ) ## Proposed Changes This PR has two aims: to speed up attestation packing in the op pool, and to fix bugs in the verification of attester slashings, proposer slashings and voluntary exits. The changes are bundled into a single database schema upgrade (v12). Attestation packing is sped up by removing several inefficiencies: - No more recalculation of `attesting_indices` during packing. - No (unnecessary) examination of the `ParticipationFlags`: a bitfield suffices. See `RewardCache`. - No re-checking of attestation validity during packing: the `AttestationMap` provides attestations which are "correct by construction" (I have checked this using Hydra). - No SSZ re-serialization for the clunky `AttestationId` type (it can be removed in a future release). So far the speed-up seems to be roughly 2-10x, from 500ms down to 50-100ms. Verification of attester slashings, proposer slashings and voluntary exits is fixed by: - Tracking the `ForkVersion`s that were used to verify each message inside the `SigVerifiedOp`. This allows us to quickly re-verify that they match the head state's opinion of what the `ForkVersion` should be at the epoch(s) relevant to the message. - Storing the `SigVerifiedOp` on disk rather than the raw operation. This allows us to continue track the fork versions after a reboot. This is mostly contained in this commit 52bb1840ae5c4356a8fc3a51e5df23ed65ed2c7f. ## Additional Info The schema upgrade uses the justified state to re-verify attestations and compute `attesting_indices` for them. It will drop any attestations that fail to verify, by the logic that attestations are most valuable in the few slots after they're observed, and are probably stale and useless by the time a node restarts. Exits and proposer slashings and similarly re-verified to obtain `SigVerifiedOp`s. This PR contains a runtime killswitch `--paranoid-block-proposal` which opts out of all the optimisations in favour of closely verifying every included message. Although I'm quite sure that the optimisations are correct this flag could be useful in the event of an unforeseen emergency. Finally, you might notice that the `RewardCache` appears quite useless in its current form because it is only updated on the hot-path immediately before proposal. My hope is that in future we can shift calls to `RewardCache::update` into the background, e.g. while performing the state advance. It is also forward-looking to `tree-states` compatibility, where iterating and indexing `state.{previous,current}_epoch_participation` is expensive and needs to be minimised.	2022-08-29 09:10:26 +00:00
Divma	8c69d57c2c	Pause sync when EE is offline (#3428 ) ## Issue Addressed #3032 ## Proposed Changes Pause sync when ee is offline. Changes include three main parts: - Online/offline notification system - Pause sync - Resume sync #### Online/offline notification system - The engine state is now guarded behind a new struct `State` that ensures every change is correctly notified. Notifications are only sent if the state changes. The new `State` is behind a `RwLock` (as before) as the synchronization mechanism. - The actual notification channel is a [tokio::sync::watch](https://docs.rs/tokio/latest/tokio/sync/watch/index.html) which ensures only the last value is in the receiver channel. This way we don't need to worry about message order etc. - Sync waits for state changes concurrently with normal messages. #### Pause Sync Sync has four components, pausing is done differently in each: - Block lookups: Disabled while in this state. We drop current requests and don't search for new blocks. Block lookups are infrequent and I don't think it's worth the extra logic of keeping these and delaying processing. If we later see that this is required, we can add it. - Parent lookups: Disabled while in this state. We drop current requests and don't search for new parents. Parent lookups are even less frequent and I don't think it's worth the extra logic of keeping these and delaying processing. If we later see that this is required, we can add it. - Range: Chains don't send batches for processing to the beacon processor. This is easily done by guarding the channel to the beacon processor and giving it access only if the ee is responsive. I find this the simplest and most powerful approach since we don't need to deal with new sync states and chain segments that are added while the ee is offline will follow the same logic without needing to synchronize a shared state among those. Another advantage of passive pause vs active pause is that we can still keep track of active advertised chain segments so that on resume we don't need to re-evaluate all our peers. - Backfill: Not affected by ee states, we don't pause. #### Resume Sync - Block lookups: Enabled again. - Parent lookups: Enabled again. - Range: Active resume. Since the only real pause range does is not sending batches for processing, resume makes all chains that are holding read-for-processing batches send them. - Backfill: Not affected by ee states, no need to resume. ## Additional Info QUESTION: Originally I made this to notify and change on synced state, but @pawanjay176 on talks with @paulhauner concluded we only need to check online/offline states. The upcheck function mentions extra checks to have a very up to date sync status to aid the networking stack. However, the only need the networking stack would have is this one. I added a TODO to review if the extra check can be removed Next gen of #3094 Will work best with #3439 Co-authored-by: Pawan Dhananjay <pawandhananjay@gmail.com>	2022-08-24 23:34:56 +00:00
Divma	f4ffa9e0b4	Handle processing results of non faulty batches (#3439 ) ## Issue Addressed Solves #3390 So after checking some logs @pawanjay176 got, we conclude that this happened because we blacklisted a chain after trying it "too much". Now here, in all occurrences it seems that "too much" means we got too many download failures. This happened very slowly, exactly because the batch is allowed to stay alive for very long times after not counting penalties when the ee is offline. The error here then was not that the batch failed because of offline ee errors, but that we blacklisted a chain because of download errors, which we can't pin on the chain but on the peer. This PR fixes that. ## Proposed Changes Adds a missing piece of logic so that if a chain fails for errors that can't be attributed to an objectively bad behavior from the peer, it is not blacklisted. The issue at hand occurred when new peers arrived claiming a head that had wrongfully blacklisted, even if the original peers participating in the chain were not penalized. Another notable change is that we need to consider a batch invalid if it processed correctly but its next non empty batch fails processing. Now since a batch can fail processing in non empty ways, there is no need to mark as invalid previous batches. Improves some logging as well. ## Additional Info We should do this regardless of pausing sync on ee offline/unsynced state. This is because I think it's almost impossible to ensure a processing result will reach in a predictable order with a synced notification from the ee. Doing this handles what I think are inevitable data races when we actually pause sync This also fixes a return that reports which batch failed and caused us some confusion checking the logs	2022-08-12 00:56:38 +00:00

1 2 3 4 5 ...

733 Commits