84 lines
		
	
	
		
			3.5 KiB
		
	
	
	
		
			Go
		
	
	
	
	
	
			
		
		
	
	
			84 lines
		
	
	
		
			3.5 KiB
		
	
	
	
		
			Go
		
	
	
	
	
	
// Copyright 2017 The go-ethereum Authors
 | 
						|
// This file is part of the go-ethereum library.
 | 
						|
//
 | 
						|
// The go-ethereum library is free software: you can redistribute it and/or modify
 | 
						|
// it under the terms of the GNU Lesser General Public License as published by
 | 
						|
// the Free Software Foundation, either version 3 of the License, or
 | 
						|
// (at your option) any later version.
 | 
						|
//
 | 
						|
// The go-ethereum library is distributed in the hope that it will be useful,
 | 
						|
// but WITHOUT ANY WARRANTY; without even the implied warranty of
 | 
						|
// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
 | 
						|
// GNU Lesser General Public License for more details.
 | 
						|
//
 | 
						|
// You should have received a copy of the GNU Lesser General Public License
 | 
						|
// along with the go-ethereum library. If not, see <http://www.gnu.org/licenses/>.
 | 
						|
 | 
						|
/*
 | 
						|
Package pot (proximity order tree) implements a container similar to a binary tree.
 | 
						|
The elements are generic Val interface types.
 | 
						|
 | 
						|
Each fork in the trie is itself a value. Values of the subtree contained under
 | 
						|
a node all share the same order when compared to other elements in the tree.
 | 
						|
 | 
						|
Example of proximity order is the length of the common prefix over bitvectors.
 | 
						|
(which is equivalent to the reverse rank of order of magnitude of the MSB first X
 | 
						|
OR distance over finite set of integers).
 | 
						|
 | 
						|
Methods take a comparison operator (pof, proximity order function) to compare two
 | 
						|
value types. The default pof assumes Val to be or project to a byte slice using
 | 
						|
the reverse rank on the MSB first XOR logarithmic disctance.
 | 
						|
 | 
						|
If the address space if limited, equality is defined as the maximum proximity order.
 | 
						|
 | 
						|
The container offers applicative (funcional) style methods on PO trees:
 | 
						|
* adding/removing en element
 | 
						|
* swap (value based add/remove)
 | 
						|
* merging two PO trees (union)
 | 
						|
 | 
						|
as well as iterator accessors that respect proximity order
 | 
						|
 | 
						|
When synchronicity of membership if not 100% requirement (e.g. used as a database
 | 
						|
of network connections), applicative structures have the advantage that nodes
 | 
						|
are immutable therefore manipulation does not need locking allowing for
 | 
						|
concurrent retrievals.
 | 
						|
For the use case where the entire container is supposed to allow changes by
 | 
						|
concurrent routines,
 | 
						|
 | 
						|
Pot
 | 
						|
* retrieval, insertion and deletion by key involves log(n) pointer lookups
 | 
						|
* for any item retrieval  (defined as common prefix on the binary key)
 | 
						|
* provide synchronous iterators respecting proximity ordering  wrt any item
 | 
						|
* provide asynchronous iterator (for parallel execution of operations) over n items
 | 
						|
* allows cheap iteration over ranges
 | 
						|
* asymmetric concurrent merge (union)
 | 
						|
 | 
						|
Note:
 | 
						|
* as is, union only makes sense for set representations since which of two values
 | 
						|
with equal keys survives is random
 | 
						|
* intersection is not implemented
 | 
						|
* simple get accessor is not implemented (but derivable from EachNeighbour)
 | 
						|
 | 
						|
Pinned value on the node implies no need to copy keys of the item type.
 | 
						|
 | 
						|
Note that
 | 
						|
* the same set of values allows for a large number of alternative
 | 
						|
POT representations.
 | 
						|
* values on the top are accessed faster than lower ones and the steps needed to
 | 
						|
retrieve items has a logarithmic distribution.
 | 
						|
 | 
						|
As a consequence one can organise the tree so that items that need faster access
 | 
						|
are torwards the top. In particular for any subset where popularity has a power
 | 
						|
distriution that is independent of proximity order (content addressed storage of
 | 
						|
chunks), it is in principle possible to create a pot where the steps needed to
 | 
						|
access an item is inversely proportional to its popularity.
 | 
						|
Such organisation is not implemented as yet.
 | 
						|
 | 
						|
TODO:
 | 
						|
* overwrite-style merge
 | 
						|
* intersection
 | 
						|
* access frequency based optimisations
 | 
						|
 | 
						|
*/
 | 
						|
package pot
 |