lighthouse

Author	SHA1	Message	Date
Paul Hauner	11c4968ea0	DO spec check before waiting for genesis (#1962 )	2020-11-25 02:00:11 +11:00
Age Manning	b6eff50ffa	Add lighthouse boot nodes (#1960 )	2020-11-25 00:05:53 +11:00
Paul Hauner	61277e3a72	Add mainnet genesis state (#1959 ) * Add mainnet genesis state * Add compressed, remove uncompressed	2020-11-24 23:21:00 +11:00
Mehdi Zerouali	ead6be074e	Remove experimental software warning (#1957 ) ## Proposed Changes Remove warning message on startup.	2020-11-24 10:29:41 +00:00
Mehdi Zerouali	011cea93b3	Update security details in README (#1956 ) ## Proposed Changes Introduces a few minor changes to the README, mainly updating mentions about security.	2020-11-24 10:29:39 +00:00
Michael Sproul	20339ade01	Refine and test slashing protection semantics (#1885 ) ## Issue Addressed Closes #1873 ## Proposed Changes Fixes the bug in slashing protection import (#1873) by pruning the database upon import. Also expands the test generator to cover this case and a few others which are under discussion here: https://ethereum-magicians.org/t/eip-3076-validator-client-interchange-format-slashing-protection/4883 ## Additional Info Depending on the outcome of the discussion on Eth Magicians, we can either wait for consensus before merging, or merge our preferred solution and patch things later.	2020-11-24 07:21:14 +00:00
Paul Hauner	84b3387d09	Add Prysm and Teku boot nodes (#1953 ) ## Issue Addressed NA ## Proposed Changes - Adds Prysm and Teku's boot nodes. The boot ENR were collected from [this Prysm PR](https://github.com/prysmaticlabs/prysm/pull/7925/files#diff-c20494db2dc1354ad056bcacaa192681386854bf036fdeef375dfe57336f27a7R42). ## Additional Info NA	2020-11-24 06:02:28 +00:00
Paul Hauner	e504645767	Update validator guide for mainnet (#1951 ) ## Issue Addressed NA ## Proposed Changes Updates the validator guide to provide instructions for mainnet users. ## Additional Info - ~~Blocked on #1751~~	2020-11-24 04:42:17 +00:00
realbigsean	a171fb8843	check if the slashing protection database is locked before creating keys (#1949 ) ## Issue Addressed Closes #1790 ## Proposed Changes Make a new method that creates an empty transaction with `TransactionBehavior::Exclusive` to check whether the slashing protection is locked. Call this method before attempting to create or import new validator keystores. ## Additional Info N/A Co-authored-by: realbigsean <seananderson33@gmail.com>	2020-11-24 03:25:40 +00:00
divma	6f890c398e	Sync Bug fixes (#1950 ) ## Issue Addressed Two issues related to empty batches - Chain target's was not being advanced when the batch was successful, empty and the chain didn't have an optimistic batch - Not switching finalized chains. We now switch finalized chains requiring a minimum work first	2020-11-24 02:11:31 +00:00
Paul Hauner	21617aa87f	Change --testnet flag to --network (#1751 ) ## Issue Addressed - Resolves #1689 ## Proposed Changes TBC ## Additional Info NA	2020-11-23 23:54:03 +00:00
Michael Sproul	7d644103c6	Tweak slasher DB schema and pruning (#1948 ) ## Issue Addressed Resolves #1890 ## Proposed Changes Change the slasher database schema to key indexed attestations by `(target_epoch, indexed_attestation_root)` instead of just `indexed_attestation_root`. This allows more straight-forward pruning (linear scan), that is also "re-entrant". By re-entrant, we mean that a pruning pass that gets stuck because of a `MapFull` error can attempt to commit midway, and be resumed later without issue. The previous pruning strategy for indexed attestations did not have this property. There was also a flaw in the previous pruning that could leave "zombie" indexed attestations in the database (ones not referenced by any attester record), which could build up and contribute to bloat (although in practice I think they occur quite infrequently). ## Additional Info During testing I noticed that a `MapFull` error can still occur during the commit of the transaction itself, which is irritating, but not unbearable. This PR should at least reduce the frequency with which users need to manually resize their DB, and if the `MapFull` on commit rears its ugly head too often we could use a dynamic strategy (temporarily increase the size of the map until the transaction commits). The extra bytes for the epoch make the database a bit heavier, so the size estimate docs have been updated to reflect this. This is also a breaking schema change, so anyone using a v0 database from a few hours ago will need to drop it and update 😅	2020-11-23 21:33:51 +00:00
Michael Sproul	5828ff1204	Implement slasher (#1567 ) This is an implementation of a slasher that lives inside the BN and can be enabled via `lighthouse bn --slasher`. Features included in this PR: - [x] Detection of attester slashing conditions (double votes, surrounds existing, surrounded by existing) - [x] Integration into Lighthouse's attestation verification flow - [x] Detection of proposer slashing conditions - [x] Extraction of attestations from blocks as they are verified - [x] Compression of chunks - [x] Configurable history length - [x] Pruning of old attestations and blocks - [x] More tests Future work: * Focus on a slice of history separate from the most recent N epochs (e.g. epochs `current - K` to `current - M`) * Run out-of-process * Ingest attestations from the chain without a resync Design notes are here https://hackmd.io/@sproul/HJSEklmPL	2020-11-23 03:43:22 +00:00
Paul Hauner	59b2247ab8	Improve UX whilst VC is waiting for genesis (#1915 ) ## Issue Addressed - Resolves #1424 ## Proposed Changes Add a `GET lighthouse/staking` that returns 200 if the node is ready to stake (i.e., `--eth1` flag is present) or a 404 otherwise. Whilst the VC is waiting for the genesis time to start (i.e., when the genesis state is known), check the `lighthouse/staking` endpoint and log an error if the node isn't configured for staking. ## Additional Info NA	2020-11-23 01:00:22 +00:00
Paul Hauner	65b1cf2af1	Add flag to import all attestations (#1941 ) ## Issue Addressed NA ## Proposed Changes Adds the `--import-all-attestations` flag which tells the `network::AttestationService` to import/aggregate all attestations after verification (instead of only ones for subnets that are relevant to local validators). This is useful for testing/debugging and also for creating back-up nodes that should be all cached up and ready for any validator. ## Additional Info NA	2020-11-22 23:58:25 +00:00
divma	d0cbf3111a	move sync state to the chains KV (#1940 ) ## Issue Addressed we have a log saying we add a peer to a chain, and an another one in case the chain is not syncing. To avoid needing to peer there two (and reduce log entries) simply log the chain's syncing state in the chain's KV	2020-11-22 23:58:23 +00:00
Michael Sproul	426b3001e0	Fix race condition in seen caches (#1937 ) ## Issue Addressed Closes #1719 ## Proposed Changes Lift the internal `RwLock`s and `Mutex`es from the `Observed*` data structures to resolve the race conditions described in #1719. Most of this work was done by @paulhauner on his `lift-locks` branch, I merely updated it for the current `master` and checked over it. ## Additional Info I think it would be prudent to test this on a testnet or two before mainnet launch, just to be sure that the extra lock contention doesn't negatively impact performance.	2020-11-22 23:02:51 +00:00
Paul Hauner	0b556c4405	Fix metrics http server error messages (#1946 ) ## Issue Addressed - Resolves #1945 ## Proposed Changes - As per #1945, fix a log message from the metrics server that was falsely claiming to be from the api server. - Ensure successful api request logs are published to debug, not trace. This is something I've wanted to do for a while. ## Additional Info NA	2020-11-22 03:39:13 +00:00
Paul Hauner	48f73b21e6	Expand eth1 block cache, add more logs (#1938 ) ## Issue Addressed NA ## Proposed Changes - Caches later blocks than is required by `ETH1_FOLLOW_DISTANCE`. - Adds logging to `warn` if the eth1 cache is insufficiently primed. - Use `max_by_key` instead of `max_by` in `BeaconChain::Eth1Chain` since it's simpler. - Rename `voting_period_start_timestamp` to `voting_target_timestamp` for accuracy. ## Additional Info The reason for eating into the `ETH1_FOLLOW_DISTANCE` and caching blocks that are closer to the head is due to possibility for `SECONDS_PER_ETH1_BLOCK` to be incorrect (as is the case for the Pyrmont testnet on Goerli). If `SECONDS_PER_ETH1_BLOCK` is too short, we'll skip back too far from the head and skip over blocks that would be valid [`is_candidate_block`](https://github.com/ethereum/eth2.0-specs/blob/v1.0.0/specs/phase0/validator.md#eth1-data) blocks. This was the case on the Pyrmont testnet and resulted in Lighthouse choosing blocks that were about 30 minutes older than is ideal.	2020-11-21 00:26:15 +00:00
Kirk Baird	3b405f10ea	Ensure deposit signatures do not use aggregate functions (#1935 ) ## Issue Addressed Resolves #1333 ## Proposed Changes - Remove `deposit_signature_set()` function - Prevent deposits from being in `SignatureSets` - User `Signature.verify()` to verify deposit signatures rather than a signature set which uses `fast_aggregate_verify()` ## Additional Info n/a	2020-11-20 03:37:20 +00:00
divma	d727e55abe	Move some rpc processing to the beacon_processor (#1936 ) ## Issue Addressed `BlocksByRange` requests were the main culprit of a series of timeouts to peer's requests in general because they produce build up in the router's processor. Those were moved to the blocking executor but a task is being spawned for each; also not ideal since the amount of resources we give to those is not controlled ## Proposed Changes - Move `BlocksByRange` and `BlocksByRoots` to the `beacon_processor`. The processor crafts the responses and sends them. - Move too the processing of `StatusMessage`s from other peers. This is a fast operation but it can also build up and won't scale if we keep it in the router (processing one at the time). These don't need to send an answer, so there is no harm in processing them "later" if that were to happen. Sending responses to status requests is still in the router, so we answer as soon as we see them. - Some "extras" that are basically clean up: - Split the `Worker` logic in sync methods (chain processing and rpc blocks), gossip methods (the majority of methods) and rpc methods (the new ones) - Move the `status_message` function previously provided by the router's processor to a more central place since it is used by the router, sync, network_context and beacon_processor - Some spelling ## Additional Info What's left to decide/test more thoroughly is the length of the queues and the priority rules. @paulhauner suggested at some point to put status above attestations, and @AgeManning had described an importance of "protecting gossipsub" so my solution is leaving status requests in the router and RPC methods below attestations. Slashings and Exits are at the end.	2020-11-19 23:33:44 +00:00
Pawan Dhananjay	e47739047d	Add additional libp2p tests (#1867 ) ## Issue Addressed N/A ## Proposed Changes Adds tests for the eth2_libp2p crate.	2020-11-19 22:32:09 +00:00
Michael Sproul	37369c6a56	Document system requirements (#1934 ) ## Proposed Changes Document some minimal and recommended system specs for running Lighthouse on mainnet with a modest number of validators.	2020-11-19 21:23:56 +00:00
Kirk Baird	c5e97b9bf7	Add validation to kdf parameters (#1930 ) ## Issue Addressed Closes #1906 Closes #1907 ## Proposed Changes - Emits warnings when the KDF parameters are two low. - Returns errors when the KDF parameters are high enough to pose a potential DoS threat. - Validates AES IV length is 128 bits, errors if empty, warnings otherwise. ## Additional Info NIST advice used for PBKDF2 ranges https://nvlpubs.nist.gov/nistpubs/Legacy/SP/nistspecialpublication800-132.pdf. Scrypt ranges are based on the maximum value of the `u32` (i.e 4GB of memory) The minimum range has been set to anything below the default fields.	2020-11-19 08:52:51 +00:00
Herman Junge	1a530e5a93	[Remote signer] Add signer consumer lib (#1763 ) Adds a library `common/remote_signer_consumer`	2020-11-19 04:04:52 +00:00
Kirk Baird	3db9072fee	Reject invalid utf-8 characters during encryption (#1928 ) ## Issue Addressed Closes #1889 ## Proposed Changes - Error when passwords which use invalid UTF-8 characters during encryption. - Add some tests ## Additional Info I've decided to error when bad characters are used to create/encrypt a keystore but think we should allow them during decryption since either the keystore was created - with invalid UTF-8 characters (possibly by another client or someone whose password is random bytes) in which case we'd want them to be able to decrypt their keystore using the right key. - without invalid characters then the password checksum would almost certainly fail. Happy to add them to decryption if we want to make the decryption more trigger happy 😋 , it would only be a one line change and would tell the user which character index is causing the issue. See https://eips.ethereum.org/EIPS/eip-2335#password-requirements	2020-11-19 00:37:43 +00:00
realbigsean	79fd9b32b9	Update pool/attestations and committees endpoints (#1899 ) ## Issue Addressed Catching up on a few eth2 spec updates: ## Proposed Changes - adding query params to the `GET pool/attestations` endpoint - allowing the `POST pool/attestations` endpoint to accept an array of attestations - batching attestation submission - moving `epoch` from a path param to a query param in the `committees` endpoint ## Additional Info Co-authored-by: realbigsean <seananderson33@gmail.com>	2020-11-18 23:31:39 +00:00
blacktemplar	3408de8151	Avoid string initialization in network metrics and replace by &str where possible (#1898 ) ## Issue Addressed NA ## Proposed Changes Removes most of the temporary string initializations in network metrics and replaces them by directly using `&str`. This further improves on PR https://github.com/sigp/lighthouse/pull/1895. For the subnet id handling the current approach uses a build script to create a static map. This has the disadvantage that the build script hardcodes the number of subnets. If we want to use more than 64 subnets we need to adjust this in the build script. ## Additional Info We still have some string initializations for the enum `PeerKind`. To also replace that by `&str` I created a PR in the libp2p dependency: https://github.com/sigp/rust-libp2p/pull/91. Either we wait with merging until this dependency PR is merged (and all conflicts with the newest libp2p version are resolved) or we just merge as is and I will create another PR when the dependency is ready.	2020-11-18 23:31:37 +00:00
Paul Hauner	bcc7f6b143	Add new flag to set blocks per eth1 query (#1931 ) ## Issue Addressed NA ## Proposed Changes Users on Discord (and @protolambda) have experienced this error (or variants of it): ``` Failed to update eth1 cache: GetDepositLogsFailed("Eth1 node returned error: {\"code\":-32005,\"message\":\"query returned more than 10000 results\"}") ``` This PR allows users to reduce the span of blocks searched for deposit logs and therefore reduce the size of the return result. Hopefully experimentation with this flag can lead to finding a better default value. ## Additional Info NA	2020-11-18 22:18:59 +00:00
Herman Junge	0c2c2cef93	Add lighthouse bootnodes (#1929 ) Gotta pump those github profile green squares!	2020-11-18 07:07:45 +00:00
Paul Hauner	7e4ee58729	Bump to v0.3.5 (#1927 ) ## Issue Addressed NA ## Proposed Changes - Bump version to `v0.3.5` - Run `cargo update` ## Additional Info NA	2020-11-18 00:44:28 +00:00
Paul Hauner	103103e72e	Address queue congestion in migrator (#1923 ) ## Issue Addressed Should address #1917 ## Proposed Changes Stops the `BackgroupMigrator` rx channel from backing up with big `BeaconState` messages. Looking at some logs from my Medalla node, we can see a discrepancy between the head finalized epoch and the migrator finalized epoch: ``` Nov 17 16:50:21.606 DEBG Head beacon block slot: 129214, root: 0xbc7a…0b99, finalized_epoch: 4033, finalized_root: 0xf930…6562, justified_epoch: 4035, justified_root: 0x206b…9321, service: beacon Nov 17 16:50:21.626 DEBG Batch processed service: sync, processed_blocks: 43, last_block_slot: 129214, chain: 8274002112260436595, first_block_slot: 129153, batch_epoch: 4036 Nov 17 16:50:21.626 DEBG Chain advanced processing_target: 4036, new_start: 4036, previous_start: 4034, chain: 8274002112260436595, service: sync Nov 17 16:50:22.162 DEBG Completed batch received awaiting_batches: 5, blocks: 47, epoch: 4048, chain: 8274002112260436595, service: sync Nov 17 16:50:22.162 DEBG Requesting batch start_slot: 129601, end_slot: 129664, downloaded: 0, processed: 0, state: Downloading(16Uiu2HAmG3C3t1McaseReECjAF694tjVVjkDoneZEbxNhWm1nZaT, 0 blocks, 1273), epoch: 4050, chain: 8274002112260436595, service: sync Nov 17 16:50:22.654 DEBG Database compaction complete service: beacon Nov 17 16:50:22.655 INFO Starting database pruning new_finalized_epoch: 2193, old_finalized_epoch: 2192, service: beacon ``` I believe this indicates that the migrator rx has a backed-up queue of `MigrationNotification` items which each contain a `BeaconState`. ## TODO - [x] Remove finalized state requirement for op-pool	2020-11-17 23:11:26 +00:00
Michael Sproul	a60ab4eff2	Refine compaction (#1916 ) ## Proposed Changes In an attempt to fix OOM issues and database consistency issues observed by some users after the introduction of compaction in v0.3.4, this PR makes the following changes: * Run compaction less often: roughly every 1024 epochs, including after long periods of non-finality. I think the division check proposed by Paul is pretty solid, and ensures we don't miss any events where we should be compacting. LevelDB lacks an easy way to check the size of the DB, which would be another good trigger. * Make it possible to disable the compaction on finalization using `--auto-compact-db=false` * Make it possible to trigger a manual, single-threaded foreground compaction on start-up using `--compact-db` * Downgrade the pruning log to `DEBUG`, as it's particularly noisy during sync I would like to ship these changes to affected users ASAP, and will document them further in the Advanced Database section of the book if they prove effective.	2020-11-17 09:10:53 +00:00
Paul Hauner	ecff8807a5	Avoid some allocations in BlockSignatureVerifier (#1922 ) ## Issue Addressed NA ## Proposed Changes Avoids growing/allocating some `Vec`s. ## Additional Info NA	2020-11-17 06:31:01 +00:00
Paul Hauner	5114aee5cf	Avoid allocations on VariableList (#1921 ) ## Issue Addressed NA ## Proposed Changes Avoids lots of grow allocations when decoding a `VariableList` of fixed-length items. This is the function used for decoding the `state.validators` list. ## Additional Info NA	2020-11-17 04:28:40 +00:00
divma	398919b5d4	router: drop requests from peers that have dc'd (#1919 ) ## Issue Addressed A peer might send a lot of requests that comply to the rate limit and the disconnect, this humongous pr makes sure we don't process them if the peer is not connected	2020-11-17 02:06:21 +00:00
Pawan Dhananjay	280334b1b0	Validate eth1 chain id (#1877 ) ## Issue Addressed Resolves #1815 ## Proposed Changes Adds extra validation for eth1 chain id apart from the existing check for eth1 network id.	2020-11-16 23:10:42 +00:00
Łukasz Sroka	4d732a1f1d	Added fn to count unicode characters (#1903 ) ## Issue Addressed Password length check too short (https://github.com/sigp/lighthouse/issues/1880) ## Proposed Changes I've added function that counts number of unicode characters, instead of calling String::len() Co-authored-by: Paul Hauner <paul@paulhauner.com>	2020-11-16 09:30:34 +00:00
Age Manning	49c4630045	Performance improvement for db reads (#1909 ) This PR adds a number of improvements: - Downgrade a warning log when we ignore blocks for gossipsub processing - Revert a a correction to improve logging of peer score changes - Shift syncing DB reads off the core-executor allowing parallel processing of large sync messages - Correct the timeout logic of RPC chunk sends, giving more time before timing out RPC outbound messages.	2020-11-16 07:28:30 +00:00
Paul Hauner	646c049df2	Add link to Lighthouse mailing list (#1913 ) ## Issue Addressed Resolves #1851 ## Proposed Changes Adds a link to the Lighthouse mailing list. ## Additional Info NA	2020-11-16 06:28:11 +00:00
Paul Hauner	836eaf559b	Check whistle-blower index (#1911 ) ## Issue Addressed - Resolves #1910 ## Proposed Changes See #1910 ## Additional Info NA	2020-11-16 06:28:09 +00:00
Paul Hauner	fe71f25c3a	Add Pyrmont testnet (#1904 ) ## Issue Addressed NA ## Proposed Changes - Replace Zinken with Pyrmont (Zinken has been sun-setted). - Ensure Mainnet is build in the build script. ## Additional Info NA	2020-11-16 05:11:35 +00:00
divma	eb56140582	Update logs + do not downscore peers if WE time out (#1901 ) ## Issue Addressed - RPC Errors were being logged twice: first in the peer manager and then again in the router, so leave just the peer manager's one - The "reduce peer count" warn message gets thrown to the user for every missed chunk, so instead print it when the request times out and also do not include there info that is not relevant to the user - The processor didn't have the service tag so add it - Impl `KV` for status message - Do not downscore peers if we are the ones that timed out Other small improvements	2020-11-16 04:06:14 +00:00
realbigsean	6a7d221f72	add slot validation to attestation_data endpoint (#1888 ) ## Issue Addressed Resolves #1801 ## Proposed Changes Verify queries to `attestation_data` are for no later than `current_slot + 1`. If they are later than this, return a 400. Co-authored-by: realbigsean <seananderson33@gmail.com>	2020-11-16 02:59:35 +00:00
divma	8a16548715	Misc Peer sync info adjustments (#1896 ) ## Issue Addressed #1856 ## Proposed Changes - For clarity, the router's processor now only decides if a peer is compatible and it disconnects it or sends it to sync accordingly. No logic here regarding how useful is the peer. - Update peer_sync_info's rules - Add an `IrrelevantPeer` sync status to account for incompatible peers (maybe this should be "IncompatiblePeer" now that I think about it?) this state is update upon receiving an internal goodbye in the peer manager - Misc code cleanups - Reduce the need to create `StatusMessage`s (and thus, `Arc` accesses ) - Add missing calls to update the global sync state The overall effect should be: - More peers recognized as Behind, and less as Unknown - Peers identified as incompatible	2020-11-13 09:00:10 +00:00
Michael Sproul	46a06069c6	Release v0.3.4 (#1894 ) ## Proposed Changes Bump version to v0.3.4 and update dependencies with `cargo update`. Co-authored-by: Michael Sproul <michael@sigmaprime.io>	2020-11-13 06:06:35 +00:00
Age Manning	c00e6c2c6f	Small network adjustments (#1884 ) ## Issue Addressed - Asymmetric pings - Currently with symmetric ping intervals, lighthouse nodes race each other to ping often ending in simultaneous ping connections. This shifts the ping interval to be asymmetric based on inbound/outbound connections - Correct inbound/outbound peer-db registering - It appears we were accounting inbound as outbound and vice versa in the peerdb, this has been corrected - Improved logging There is likely more to come - I'll leave this open as we investigate further testnets	2020-11-13 06:06:33 +00:00
Paul Hauner	8772c02fa0	Reduce temp allocations in network metrics (#1895 ) ## Issue Addressed Using `heaptrack` I could see that ~75% of Lighthouse temporary allocations are caused by temporary string allocations here. ## Proposed Changes Reduces temporary `String` allocations when updating metrics in the `network` crate. The solution isn't perfect since we rebuild our caches with each call, but it's a significant improvement. ## Additional Info NA	2020-11-13 04:19:38 +00:00
blacktemplar	c7ac967d5a	handle peer state transitions on gossipsub score changes + refactoring (#1892 ) ## Issue Addressed NA ## Proposed Changes Correctly handles peer state transitions on gossipsub changes + refactors handling of peer state transitions into one function used for lighthouse score changes and gossipsub score changes. Co-authored-by: Age Manning <Age@AgeManning.com>	2020-11-13 03:15:03 +00:00
realbigsean	cb26c15eb6	Peer endpoint updates (#1893 ) ## Issue Addressed N/A ## Proposed Changes - rename `address` -> `last_seen_p2p_address` - state and direction filters for `peers` endpoint - metadata count addition to `peers` endpoint - add `peer_count` endpoint Co-authored-by: realbigsean <seananderson33@gmail.com>	2020-11-13 02:02:41 +00:00

... 2 3 4 5 6 ...

3952 Commits