Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

H. BrakmićBitcoin and Lightning Network on Raspberry Pihttps://doi.org/10.1007/978-1-4842-5522-3_2

2. Bitcoin Technology

Harris Brakmić¹

(1)

Troisdorf, Germany

In this chapter we will learn the basics of blockchain, addresses, transactions, and Bitcoin’s embedded scripting language. We will generate addresses, create transactions, and send them through the network to explore the inner workings of Bitcoin’s scripting engine.

Blockchain

Bitcoin’s blockchain is an append-only database containing an ordered, back-linked list of blocks that’s being replicated over tens of thousands of nodes, which continuously validate new blocks and update their copies according to the current consensus. Very often we can read descriptions of blockchains as decentralized and distributed databases, which I consider only partially correct. Although their decentralized nature is obvious due to the fact that every node maintains its own copy and only accepts those blocks which obey the Consensus Rules embedded in Bitcoin’s code, the second part of the definition isn’t quite correct. Bitcoin’s blockchain isn’t distributed but replicated , because every node must do the same work by executing the same Consensus Rules, before a block gets accepted as valid.

If the blockchain was distributed, the participating nodes wouldn’t do the whole work, but only a part of it, because execution of Consensus Rules would be distributed over many nodes thus lowering the pressure on individual nodes. This, however, would be contrary to Bitcoin’s design, where every node must execute every task individually in order to make changes in the blockchain exceptionally expensive. When every node has to do the same work and validate every transaction, then any attempt to manipulate those entries would become exceptionally expensive for any attacker, because one would have to cheat all of the nodes at the same time. Therefore, we should speak of blockchains as decentralized and replicated structures. The blockchain structure comprises of blocks that carry transactions which in most cases describe transfers of funds between two parties, but there also can be more complex variants with more than two participants as we will see in later chapters. As every block is linked to its predecessor, we call this structure back-linked (Figure 2-1).

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig1_HTML.jpg — Figure 2-1
Blockchain structure

The link between two blocks is the hash value of the header of the preceeding block as we saw it in the previous chapter. Besides the header, every block also contains a list of transactions which carry information about the ownership of coins. A transaction typically assigns funds from one party to another. The technology behind Bitcoin’s blockchain implementation is based on Google’s LevelDB,¹ which is a very fast on-disk key-value store. Although the word “blockchain” associates with a structure containing chained elements, a better description of it would be a stack, with elements put on top of each other. In Figure 2-2 we see a stack of blocks, each back-linked to its predecessor.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig2_HTML.jpg — Figure 2-2
A back-linked stack of blocks

The topmost block is also called a “tip”. Later, when we start learning how to use Bitcoin’s RPC API,² we will execute commands that give us detailed information about a blockchain’s state. Another term often used is “parent block” which means the predecessor of a block. As every block contains a hash as its identifier and links back to its predecessor, this block is then being called “parent” block.

In the previous chapter, we have seen that all blocks in Bitcoin’s blockchain ultimately go back to the very first block, the Genesis Block. Every block only has a single parent, but there could exist more than one “child block” that is not yet a confirmed member of the blockchain. Each time a new candidate block arrives in the network, there is also a mining entity behind it that wants to win the racing game. Getting blocks included in the chain is the only way to get rewards. And sometimes it comes to situations where multiple miners have created competing blocks which are valid according to Consensus Rules, but because they’re representing different possible states of the future blockchain, nodes validating them will have to decide which of them will ultimately become the next “tip”.

In such a situation, nodes would temporarily accept every valid block by creating several variants of the blockchain as shown in Figure 2-3. And as miners would continue their racing game by producing more blocks, the ultimate decision will be done later after new blocks have arrived. Those blocks would then extend one of the possible variants of the chain. And because a block can only have one parent, validating nodes would automatically select the variant whose upmost block is the parent of the newly arrived block. In the end, the longest chain variant would win, because it stands for the most work being done. Other variants of the blockchain, also called “forks”, will get discarded.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig3_HTML.jpg — Figure 2-3
A temporary fork with an orphaned and a winning chain

These operations are nothing exceptional as the design of Bitcoin allows for this flexibility under the assumption that the majority of mining nodes is honest and not willingly manipulating the chain. But as we have already seen in the previous chapter, the competition in Bitcoin and its aligned system of rewards and punishments is keeping all of its participants acting honestly, because any other behavior would ultimately lead to severe monetary losses. This behavior is also a practical example on how the Emergent Consensus happens in the blockchain. By allowing several forks to exist, the Bitcoin protocol leaves enough room for nodes to find a solution which would be very hard to achieve if there was some hard-coded rule that would ultimately decide on what kind of block “deserves” to become the tip of a valid chain.

Moreover, such a rule would ultimately lead to a dangerous partition of the network, which would render Bitcoin unusable as currency, because several variants of “Bitcoin” could exist at the same time. To prevent this the Emergent Consensus exists, which allows participating nodes to resolve any “disputes” automatically. As Bitcoin nodes can join and leave the network anytime, there is always the question: What is the current state of the chain? As we will see on the following pages, a node must first take care of discovering³ as many peer nodes as possible and also informing them about its own existence, before it can start processing any blocks. Also, it must get enough information about the current state of the chain, for example, about the current height and which services other nodes support. A decentralized network is forcing every node to find its own way “out of the dark”, before it can start validating blocks and transactions.

Peer-to-Peer Networking

As no Bitcoin node can function without sharing blockchain information, the question is, how does it find others in the first place? The answer lies in the peer-to-peer, or P2P, network that nodes use to discover each other and exchange data. By default, every node carries a few hard-coded IP addresses and DNS entries that help it kick off the initial discovery procedure. Those nodes, also called seed-nodes , contain information about further nodes, which our node can then use to create a more detailed topology of the network. Being decentralized, Bitcoin’s network has a flat structure without any servers so that every node must find its neighbors on its own, by trying to connect with other IP addresses.

A completely new node would use the eight hard-coded seed-nodes⁴ to send them certain information about itself by using the version-message⁵ that contains these entries:

The protocol version it’s following, like 70002
The services it supports, like NODE_NETWORK or NODE_SEGWIT
The current time
The IP address of the node it contacted
Its own IP address
A string describing the local client version, like /Satoshi:0.18.0/
Current blockchain tip⁶

The contacted node would then analyze this message and send back an acknowledgement message called verack .⁷ After the node has contacted some of the hard-coded seed-nodes, it’d then receive a list of IP addresses of other validating nodes, which it can then use to expand its network topology. The seed-nodes are basically special DNS servers⁸ that mimic the DNS protocol by answering queries on port 53, the default DNS port. Each time a node sends a query on port 53, they send a list of IP addresses back. However, merely querying for IP addresses isn’t enough as every node also tries to make other nodes aware of its existence. Therefore, a node would also send addr-messages ⁹ to its peers, which then would forward it to their own peers, thus expanding the reach of the new node. Also, a node can query other nodes by sending getaddr-messages to receive its peers’ address lists. The strategy of this protocol is clear: to automatically discover, expand, and update the knowledge about the network.

One can also manually define or even disable DNS and peer discovery. To disable DNS completely, the Bitcoin client or daemon can be started with the flag dnsseed=0. To add further nodes, one can add their IP addresses and ports in bitcoin.conf, the configuration file of the Bitcoin client and daemon, with the entry addnode.¹⁰ If peers are using the default TCP port 8333, then only the IP address is needed. If only a single seed-node is needed, the flag seednode can be used to define the DNS node to be contacted at the next client or daemon start.

To query information about connected peers, the command getpeerinfo can be used. Its output delivers a list of known peers and their client information. The contents returned look like this:

{

"id": 0,

"addr": "167.71.192.31:8333",

"addrlocal": "35.200.105.117:59664",

"addrbind": "192.168.0.87:59664",

"services": "000000000000040d",

"relaytxes": true,

"lastsend": 1567747871,

"lastrecv": 1567747886,

"bytessent": 1574683,

"bytesrecv": 1831044,

"conntime": 1567533523,

"timeoffset": 0,

"pingtime": 0.187978,

"minping": 0.170567,

"version": 70015,

"subver": "/Satoshi:0.18.0/"

}

After a node has discovered multiple peers, it will start exchanging its data with them. In case of a completely new node, all it will have to offer is the embedded Genesis Block that starts at chain height 0. It will send a getblocks message as shown in Figure 2-4, which contains its highest known block hash. Its peers will find the hash in their own copies of the blockchain and respond with an inv message (inventory) containing the first 500 hashes of blocks that came after the last known block of the new node. The queried node then takes the hash provided and tries to find its position in its own copy. If it was able to locate it, it would then generate a list of 500 block hashes that are descendants of the hash from querying node.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig4_HTML.jpg — Figure 2-4
Finding other nodes

The reason why peers answer with only 500 hashes is to prevent overwhelming the asking node. The asking node will then use those 500 hashes to send further getblocks messages to other nodes, which then in turn will send another 500 hashes that are located on higher positions in the chain.

This way a node can easily send many queries that can be combined into an ordered line of blocks. To get full block data, a node would have to send a message called getdata that contains the hash of the block whose data should be returned.

The whole procedure of querying and updating peers might seem too complex, but one must not forget that in decentralized networks, every participant can go offline anytime without warning. Therefore, nodes are continuously updating and validating their connections to ensure that they always have the most reliable channels, because only those can ensure access to most recent blockchain data. The quest for peer nodes has always been a vital part of Bitcoin’s network, and in the past, it has even used the IRC chat protocol for such tasks. The modern variants of Bitcoin Core client no longer use IRC, but the old source code is still available as an interesting artifact from Bitcoin’s past.¹¹

Node Types

Another important aspect in Bitcoin’s P2P network is the specialization of nodes. Although the protocol itself doesn’t distinguish between various types of nodes, there are configuration options available to maintain and operate nodes with different specializations. We have already mentioned the term Full Node ,¹² without having properly defined it. A Full Node is any node that maintains its own wallet, has a full blockchain copy, and validates transactions without relying on any external service or node. Throughout this book we will be building Full Nodes and later also Lightning Nodes that are based on them.

But these aren’t the only available types of Bitcoin nodes. There are also SPV nodes (simplified payment verification) that maintain a lighter version of the full blockchain, which needs much less disk space, and are therefore dependent on other nodes with full blockchain copies. One of the use-cases for such nodes are various mobile wallet applications. As the full blockchain copy consumes more than 250GB, we couldn’t run such a node on a mobile device. But as we will see later, a mobile node only needs to download the headers of blocks which are always 80 bytes long. Currently, the space needed to save all those headers would be less than 50MB, which is not a problem for any modern smartphone. This is possible because blocks are chained together via their block headers so that the second half of each block, the transactions, can be discarded. Only when a mobile node needs information about a certain transaction it would have to issue a request against a Full Node that has the complete information saved in the blockchain. This configuration of course raises another question that has to do with the integrity of information being processed by mobile wallets. As mobile nodes would from time to time need external nodes to process their transactions, one could never achieve the same level of security and independence as possible with Full Nodes. This is a known trade-off that can be mitigated by using own Full Nodes as the only “trusted nodes ” to communicate with. Many good mobile wallets offer an option to configure one or more Full Nodes as sources of transaction information.

Another important functionality we already talked about is routing, which is part of every functioning node . And of course, the mining functionality itself is available in every Bitcoin Core client, but only a subset of running nodes in the network are actually using it. Those functionalities can be combined as not every node has to have all of them activated. On the following pages, we will explore different node types and how they are being used.

For example, a mining node that’s participating in a mining pool doesn’t have to maintain its own blockchain copy and can off-load this responsibility to the pool itself. All it needs to know is the current difficulty and the transactions waiting in the memory pool (Figure 2-5).

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig5_HTML.jpg — Figure 2-5
Mining node structure

Additionally, it uses a protocol for communication with the pool server. Very often this protocol is Stratum,¹³ but in the future, there will be other mining protocols available, for example, BetterHash,¹⁴ that’s currently in development.

It is also possible to run a mining node without joining a pool through a technique called solo mining , where a node contains a full blockchain copy and takes care of the current difficulty and block generation by itself as shown in Figure 2-6. It doesn’t need to join a mining pool as all the information it needs for successful mining is already located inside of it.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig6_HTML.jpg — Figure 2-6
A solo-mining node

Another example is a mobile application shown in Figure 2-7, where users are only interested in maintaining their wallets and keys. There is no need for them to keep the whole blockchain copy that’s currently over 200GB in size.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig7_HTML.jpg — Figure 2-7
A mobile node

Mining pool operators too run specific nodes that maintain connections between the original Bitcoin protocol and mining nodes that utilize Stratum or other mining protocols to participate in mining operations. Although this part of the network doesn’t belong to the decentralized Bitcoin network, it’s still a vital part of the overall ecosystem, as it helps maintaining the communication paths between nodes that create blocks and all others that validate them. However, when it comes to block propagation, this combined or extended Bitcoin network isn’t fast enough so that miners utilize yet another network called Fast Internet Bitcoin Relay Network or FIBRE ¹⁵, which helps minimizing latencies in communication between miners. As we already know, the faster miners propagate new blocks, the higher the chance that they’ll get selected as the next chain tip. The Full Nodes we will be building are shown in Figure 2-8. They will contain their own blockchain copy, manage their own wallets, and have routing activated but won’t participate in mining operations.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig8_HTML.jpg — Figure 2-8
A Full Node

There is also another type of Full Node which uses a pruned blockchain copy to save disk space. Such nodes have the same security level as nodes with full copies but don’t keep the whole transaction history locally. Their blockchain copy is of smaller size as they don’t keep previous transactions after they have been validated.

Signatures

To control ownership in Bitcoin, we need certain technologies from the field of cryptography, which is a branch of mathematics. Usually, when we say cryptography, we think about writing secret, encrypted texts or development of algorithms, that are very hard to break. But in Bitcoin we aren’t that much interested in encrypting information as everything that’s written in its blockchain will remain public forever, because it must be accessible for validation of transactions and blocks. What we want instead from cryptography are other, maybe lesser-known tools like digital signatures and keys .

Bitcoin is not only about technology but also about ownership. Before we can spend any bitcoin, we must have provided a proof that we own it. Before someone can send us any “coin”, he or she must have included certain information that points at us as future owners. The question of ownership and its transfer is essential to Bitcoin, and this is why it relies on proven cryptographical technologies, which have been available for decades. In fact, Bitcoin rests on several foundations that predate the World Wide Web, and even the DNS protocol.

One of those foundations is the Public Key Cryptography ,¹⁶ that was invented in the 1970s. With PKC we can generate key-pairs that comprise of two keys, one private and the other public. The private key we always keep to ourselves and never disclose it, while the public key can be used for communication with other parties. The mathematics¹⁷ behind this technique is quite complex and could easily fill several books so that we will only concentrate on its utilization. Both of the keys we can keep in our wallets, which are very often built into the software we’re using.

For example, the standard Bitcoin Core Wallet , which we’ll compile from scratch, is one such software. However, there are other types of wallets like Hardware Wallets, which are small devices that behave like air-gapped computers¹⁸ and never let the private key escape the security chip. Most often we’ll be keeping both types of keys in our wallets, but we could also keep private keys only, as the public ones can be easily generated based on private keys. This is due to the nature of PKC, which allows to generate an almost infinite number of public keys based on a single private key, but not vice versa. Therefore, one of the most important features of PKC is the fact that it is very easy to go from private key to public key, but practically impossible to find out a private key based on the information provided by a public key. However, although very important, this is not everything we need in Bitcoin. As already mentioned, we aren’t interested in creating secret messages but instead in securing ownership of funds written in the blockchain. For this we need an additional feature from PKC, the digital signatures .

A digital signature is a number generated by a hash function that’s being used in Bitcoin to prove the ownership of funds. If I want to spend bitcoin, I must create a transaction which shows that I am indeed the owner of this bitcoin, because otherwise Bitcoin’s scripting engine would refuse to process my transaction. Therefore, I’d use my private key as input data for a hash function that will generate a corresponding digital signature, which then can be later checked for validity by using my public key. With hash function we mean functions that can take any kind of input data, whatever their lengths, and return a digest value of a fixed length. No matter how long our inputs are, the returned “fingerprints” will always be of the same length but each time different. And even if we would change only a single element of input data, the generated fingerprint would change drastically. Hash functions are often being used to check the integrity of documents and to ensure that original data hasn’t been changed.

Another interesting property of PKC is its asymmetry: one can check a digital signature created by a private key by using a public key that’s related to this private key. The key we use to create signatures isn’t the key for checking its validity. One can easily check the results of certain operations (output), but it’s practically impossible to calculate back to original (input) values. It is easy to check a digital signature for its validity, which is the output value of an operation done with a private key, but it’s impossible to calculate the private key itself based on a signature alone. This directly gives the answer to the question on how one could ever be able to secure funds on a public blockchain. As participants never reveal their private keys and only let others check their ownership claims via public keys and signatures, there is no way for any party to steal funds from any other.

Addresses

One example of a digital signature used in Bitcoin is addresses. Most often addresses are based on hashed values of a public key. However, this is not always the case as addresses can be generated by using various inputs as we will see later. In its infancy, the Bitcoin protocol used raw public keys as addresses without hashing them in any way. Early Bitcoin Wallets could even send funds to IP addresses. This functionality was later removed as it was prone to man-in-the-middle attacks.¹⁹ Over time, more complex and also more powerful address types arrived on stage.

To generate an address based on a public key, we’d have to complete a task comprising of several steps that always begins with two hashing functions: SHA256 and RIPEMD160. Written as pseudo-source code, it’d look like this:

MyHashValue = RIPEMD160(SHA256(Public Key))

We take one of our public keys and let SHA256 generate a hash based on it. Then we use this hash as input value for RIPEMD160 function, which then returns another hash value. Basically, a hash of a hash. However, this is not the final hash value we’ll be using as our address, because in the next step, we’d have to encode it in Base58²⁰ format, which is a subset of the more widely known Base64 format. Base58 prohibits the usage of certain ambiguous characters (0, O, l, and 1) to improve readability. In most cases the Bitcoin addresses are being generated by using the Base58Check format that not only generates the final output but also checks for potential errors. Therefore, the addresses generated by it always contain four additional bytes at the end that represent the checksum of the address, which are being used for validation.

And because we have different types of data in Bitcoin, we will also need to prefix our data with a single byte that represents its version. In our case this would be a zero, because we are creating a Bitcoin address. The checksum that we generate is based on the hashed value and this version byte. To get the checksum, we use double-SHA256 function from which we then take 4 bytes and append to the hash value of our public key we generated at the beginning.

This part comprises of two steps:

Get Checksum from VersionByte+Data.
Get Base58Check format from VersionByte+Data+Checksum.

Written in pseudo-code:

Checksum = SHA256(SHA256(VersionByte + MyHashValue))

Address = Base58Check(VersionByte+HashValue+4ChecksumBytes)

The final result is a string of alphanumeric characters that begins with a 1 like this:

1Q1K26998Y5S7MfV8YwH8R1NfGo4jf73VR

This complex procedure can be graphically represented as in Figure 2-9.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig9_HTML.jpg — Figure 2-9
Generating a Bitcoin address

Here are a few examples of valid Bitcoin address-type variants with their prefixes in bold:

P2PKH: Pay-to-Public-Key-Hash²¹:

1LwQE6PHYyPDVbwmUvttH88hWiAPC74tcp

P2SH: Pay-to-Script-Hash or SegWit²²:

32LnEubWP9T9FDWJbEk3DzfycH7DnZA7Dx

Bech32 (P2WPKH or P2WSH):

bc1qlm2lv7nw9tede4gwlgq8vhdp0aqu88gphyh4jm

The first example represents the oldest address type that directly maps to a public key. This is the address we just generated with our pseudo-code. The other two address types are based on hashes generated from scripts that we will meet in later chapters. A script in Bitcoin means an operation that will be executed during a transaction and whose returned value will decide if this transaction will be accepted or not. The scripts could contain any condition or logic that return Boolean values (true/false). Later, we will learn how to write scripts, but for now we should keep in mind that P2SH and Bech32 addresses can represent more complex structures than mere public keys. Instead of having a hash of a public key, we now have a hash of a whole script, which is an advantage as hashes are of constant length no matter how long the input value was.

These address formats can be used freely and without any constraints. One can send funds from and to other address types without any problems. The reason Bitcoin has different address types is because of historical and technical reasons. For example, P2SH-type addresses aren’t based on public keys but instead on hashes of scripts, which must be executed by their respective receivers to receive funds.

To test all those key types, one can use web sites like https://bitaddress.org. However, if you are going to generate addresses for private use, you must not generate them in an online browser as there is always a security risk that someone could eavesdrop your communication and steal your private keys. Better download the page and go offline when creating new key pairs. Another way to create them is by using the command getnewaddress ²³ in the Console Window of the Bitcoin Core client or via the command line tool bitcoin-cli . You can also set the type and alias of the address to be generated when executing this command.

To generate a legacy address with bitcoin-cli , we type

bitcoin-cli getnewaddress "my_first_btc_address" legacy

The result will be an address with prefix 1, like this one:

1NgaCnCfCaVRZULduiVn2AZAy9zsAd342V

The string “my_first_btc_address” is the label for the address. To generate a P2SH-SegWit address a different flag is needed:

bitcoin-cli getnewaddress "my_p2sh_address" p2sh-segwit

The resulting address would begin with 3, like this one:

3JERmTm3qsqDRVpomiEvgj6YRVFV4Ua49s

To generate a Bech32-type address , one would use the flag bech32. Every public key is a result of a special mathematical operation called the elliptic curve multiplication ,²⁴ where the corresponding private key is being used to calculate a public key. To find out the private key of an address, we use the command dumpprivkey

dumpprivkey 3JERmTm3qsqDRVpomiEvgj6YRVFV4Ua49s

The result would be the private key prefixed with K or L:

KxRWVQ2nLLoptGHKCK14Jok91RQs6WMkmJNkoa2kjXR4x2WuRPF3

If your wallet is encrypted, you will have to unlock it first with command walletpassphrase , before you can use dumpprivkey and similar operations, which are able to change the contents of a wallet. Of course, such commands should never be used in public or on machines, which are not properly secured, as knowledge of private keys ultimately means ownership of funds associated with them. Not your keys, not your coins, is a well-known saying in Bitcoin circles. Another useful command for handling addresses in dumpwallet , which is used to export all keys to a human-readable local file. The opposite command, for importing such files, is importwallet.

The public and private keys being used in Bitcoin can be represented by using different formats. The software itself relies on raw bytes, and most of the time, we won’t have to deal with those 256-bit numbers. Instead, we will be using WIF (wallet import format) keys, which combine public key hashes, version prefixes, and checksums.

There are also testnet²⁵ and regtest ²⁶ address types, as we will see later, which are being used for testing purposes. In Table 2-1 is shown a list of address types, their Base58 prefixes, and hexadecimal counterparts.

Table 2-1

List of available address types in Bitcoin

Type	Hexa	Base58	Example
P2PKH	0x00	1	1JM25UwUUkGuKxzjjWzHGH8a556GTakJFW
P2SH	0x05	3	3GJAAhX7fnR4TDFoLR1gYMroA1ZRVF6Mzg
Private key compressed (WIF)	0c80	K or L	KyiAsiqteZL7yg5qzuPN5HvWHZrE5MbAUtr44YfmE3KpKVcWNhqu
Private key uncompressed (WIF)	0c80	5	5JhEdxAiep1fATDEa6TiTf3wFDc2Q6rifhMjv2AcY6j3JqdGnMX

Transactions

Without transactions there would be no Bitcoin. Everything revolves around transactions as only through them the ownership and transfer of funds are made possible. All other parts of the Bitcoin ecosystem are involved in securing, propagating, and processing transactions. Bitcoin transactions are both network signals and blockchain entries, which deal with transfer of funds between parties. Although easily accessible via various blockchain explorers, transactions we see there have little in common with real transactions as they appear in network messages or get written in the blockchain.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig10_HTML.jpg — Figure 2-10
A transaction shown on blockchain.info web explorer²⁷

We will now try to “translate” the data shown in Figure 2-10 into entries as defined by the Bitcoin protocol. Here we see an address on the left that’s transferring around 12.54BTC to two addresses. We also see some data under the two target addresses, which describes inputs, outputs, and fees. This is quite a lot of information at once, so let’s dissect this transaction by using the commands available via the Bitcoin Client or Bitcoin CLI. In general, all options available via the graphical client are also available via the command line interface (CLI), and very often it’s recommended to use the CLI, as this tool can be used to create more complex shell scripts and doesn’t require a GUI system. To list the raw data that constitutes the above transaction, we use the command getrawtransaction . Afterwards, we use decoderawtransaction to turn this long hexadecimal string into a human-readable JSON²⁸ object, which contains following values ordered as key-value pairs. Each pair has a name (key) and a value assigned to it. This is the JSON object referencing the transaction:

{

"txid": "d6c92078059fcea673ac07246552c976be6cee5d37990486100dfdde220a2134",

"hash": "d6c92078059fcea673ac07246552c976be6cee5d37990486100dfdde220a2134",

"version": 2,

"size": 223,

"vsize": 223,

"weight": 892,

"locktime": 0,

"vin": [

{

"txid": "e5244f26b62119a2bc45e6c7c8975243662e487f7e9369a9dcc4fafa0f8190c4",

"vout": 0,

"scriptSig": {

"asm": "3044022051acb984fe96ebc690ac7987c4531875bb39466bb557b33ecc952d514842407c02202d691613c6974f53381d938b83b883a795e81e40ef07d8917eb9ce115bcc0c0d[ALL] 03ebbc9c06350d95f1a50273b3bbb46f51827168afea87b8240bc0c2adc520202e",

"hex": "473044022051acb984fe96ebc690ac7987c4531875bb39466bb557b33ecc952d514842407c02202d691613c6974f53381d938b83b883a795e81e40ef07d8917eb9ce115bcc0c0d012103ebbc9c06350d95f1a50273b3bbb46f51827168afea87b8240bc0c2adc520202e"

"sequence": 4294967295

}

"vout": [

{

"value": 11.94413003,

"n": 0,

"scriptPubKey": {

"asm": "OP_DUP OP_HASH160 c6ca904d6c12bd983b983ae21b7699dde96a3a5f OP_EQUALVERIFY OP_CHECKSIG",

"hex": "76a914c6ca904d6c12bd983b983ae21b7699dde96a3a5f88ac",

"reqSigs": 1,

"type": "pubkeyhash",

"addresses": [

"1K87TMWzsQLzdZb4kUmKAcrCWnQqdVW5QJ"

]

}

{

"value": 0.59808344,

"n": 1,

"scriptPubKey": {

"asm": "OP_HASH160 1574b4d35af35f2b7ec7cbe96c9f85adc91a0521 OP_EQUAL",

"hex": "a9141574b4d35af35f2b7ec7cbe96c9f85adc91a052187",

"reqSigs": 1,

"type": "scripthash",

"addresses": [

"33eTvUJYuaMLqVh2ejG83UAcYeAgCwgNWW"

]

}

]

}

There’s plenty of information as we see, but where does the transfer of funds happen? We see some addresses, but there is no direct way of recognizing who gets what. To answer this question, one has to describe how Bitcoin transactions actually work. As we already know, there are no “accounts” in Bitcoin, that participants could use for spending or receiving funds. Being decentralized, Bitcoin offers no option to set up any kind of registration authority for its users. And because there are no “user accounts”, the first problem we approach is: How can we safely send funds from one party to another, if there are no “parties” at all? One part of the answer lies in the asymmetric cryptography and digital signatures, which take care of assigning funds to their owners and prohibiting access to anyone, who’s not able to provide a valid proof of ownership. As we have seen already, digital signatures are related to private keys but can’t be used to calculate them. Only owners of private keys are able to access funds assigned to their corresponding signatures. Another important fact is the way Bitcoin protocol sees those funds. Everything that’s not been spent is part of the global UTXO set (unspent transaction output). In fact, there are no “coins” in Bitcoin. They only exist as a user-friendly concept that’s constructed by applications which implement the Bitcoin protocol. The rest of the answer we will discover shortly.

All the network sees is a set of unspent outputs from every owner, regardless when the last transaction happened. Just like with “coins”, Bitcoin knows nothing about addresses, as they’re mere user-friendly concepts and not directly visible in the blockchain. There is no way for anyone to go from one address to another or track any inputs or outputs. The only structure Bitcoin sees are small programs that get executed by its internal scripting engine. Bitcoin’s functionality relies on an embedded programming language that never got a proper name so that we call it “Script”. Later we will learn a bit more about it, but for now, we should keep in mind that all Bitcoin does is execute small scripts (programs), which ultimately decide about ownership of funds.

When we want to spend our coins, we have to give exact information which of the parts of the available UTXOs belong to us, that is, we have to provide a valid proof of ownership to the network so we can access those funds. Spoken more abstractly, we could say that creating transactions in Bitcoin is actually changing the state of the UTXO set. There is actually no option to move any coins in Bitcoin, because there are not only no coins available, but also because there isn’t anything that could be moved at all. The only option we have, and this is the essence of every transaction, is to change the ownership information regarding some part of the UTXO set. What moves in Bitcoin is the ownership objects and not the objects themselves.

In many ways, it would be better to use the land metaphor to describe ownership in Bitcoin than metallic coins. Just like a person can own a piece of land and hand it over to someone else, without ever being able to move it, the same happens with bitcoins, which one “sends” to someone else. In most cases, the balance is being calculated by the software used to access Bitcoin’s network. The sum of all unspent outputs is presented as a certain number of bitcoins, which is just a more convenient way to present the current UTXO state to the user.

And just like any other currency, bitcoins too can be divided into smaller parts. The division can go down to eight decimal positions. The smallest unit is called Satoshi, named after Bitcoin’s pseudonymous creator. But one should keep in mind that bitcoins never move inside the blockchain. Moreover, the transactions keep a record of the current state of the global UTXO set . And some of those pieces of UTXO belong to us, because we can provide valid proofs of ownership by using our private keys and digital signatures as shown in Figure 2-11.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig11_HTML.jpg — Figure 2-11
The global UTXO set

Another important rule regarding UTXOs is that they can only be consumed once and as a whole. There is no way to split up an UTXO before including it in a transaction. This of course raises the question of what should be done when an UTXO contains a value that’s higher than the one we would like to spend. If I own two bitcoins, which come from some previous transaction, but want to spend only one in a future transaction, what would happen to my other bitcoin? The rule in Bitcoin protocol states that any bitcoins left automatically get included in transaction fees paid to the miner of the respective block. But we of course wouldn’t want to lose our remaining bitcoins.

If we look at the JSON object shown before, we notice the vin entry. This is the input transaction , that consumes all UTXOs, which we will be using to create a new output transaction . Before we can create any outputs, we have to consume some inputs. This is the source of our funds, that we will assign later, in the vout-section of the JSON object. There, we see two entries, each with a different value and addresses . This is the place where the previous input funds get sent to new addresses. In most cases, one of the entries will be a so-called change address ²⁹ that’s sending the difference back to the creator of transaction. Usually, the software we use would automatically generates such an address. This is the place where we take care of getting back all the bitcoins that we don’t want to send (or let miners take them from us). By following this strategy, the Bitcoin protocol elegantly avoids any kind of accounting logic, because everything that deals with the ownership of funds can be maintained by using transactions alone.

However, there are no rules without exceptions, and in this case the exception is a special transaction we already named in the previous chapter: coinbase.³⁰ This transaction has no input UTXO as we would maybe expect. Instead, its role is to bring new bitcoins into existence and also collect all the fees for transactions included in the block. Both new bitcoins and fees constitute the reward for the miner who created the block. In fact, the input transaction of this special transaction is called coinbase .³¹ Here is an example from block 1000 that was generated in 2009.

"vin": [

{

"coinbase": "04ffff001d02fd04",

"sequence": 4294967295

}

"vout": [

{

"value": 50.00000000,

"n": 0,

"scriptPubKey": {

"asm": "04f5eeb2b10c944c6b9fbcfff94c35bdeecd93df977882babc7f3a2cf7f5c81d3b09a68db7f0e04f21de5d4230e75e6dbe7ad16eefe0d4325a62067dc6f369446a OP_CHECKSIG",

"hex": "4104f5eeb2b10c944c6b9fbcfff94c35bdeecd93df977882babc7f3a2cf7f5c81d3b09a68db7f0e04f21de5d4230e75e6dbe7ad16eefe0d4325a62067dc6f369446aac",

"reqSigs": 1,

"type": "pubkey",

"addresses": [

"1BW18n7MfpU35q4MTBSk8pse3XzQF8XvzT"

]

The first entry in its “vin” is coinbase , and its “vout” comprises of a single entry that spends 50BTC, which was the initial block reward before the first halving in 2012, to address 1BW18n7MfpU35q4MTBSk8pse3XzQF8XvzT. Also, we see that the given type of address is pubkey . As mentioned before, the initial protocol version only used public keys without hashing them. Nowadays, the usage of raw public keys as addresses is discouraged, and only hashes of public keys (P2PKH) or pay-to-script-hash (P2SH) should be used. Another important piece of data is the number associated with reqSigs key . This value indicates the number of required signatures that is needed to unlock the funds. In both of the JSON objects presented, there was only a single signature needed, but Bitcoin also supports so-called multisig ³² transactions, that is, transactions where multiple signatories (each with own private key) create signatures that get included in a particular transaction.

To later access those funds, a certain minimal number of parties is needed to prove their ownership of funds by providing signatures, which correspond to previously given public keys. There are various options to create such multi-signature transactions, for example, 2-of-3, 2-of-2, etc. Each combination means that at least a certain number of signatures must be given, before funds can be spent. So, 2-of-3 means that if there were three signatories who locked the funds, later at least two of them must give their consent to spend the funds. Such combinations are very useful when executing contracts created in the real world, outside of blockchain, where parties could get prevented from or deliberately refuse to participate in the unlocking procedure. Also, multi-sig transactions could be used to enhance the security of funds, as one could keep several keys on different machines, or hardware wallets. In case of theft, funds would still be safe, as the attacker would need access to multiple keys, which is less likely to succeed. We will later talk about them and their importance in the Lightning Network in more detail. For now, we will only think about two participants in a transaction: a sender and a recipient.

To complete the analysis of a Bitcoin transaction, we have to understand how bitcoins get spent in the first place. As there is no way to spend anything in Bitcoin without having received it via some input transaction, we will now check the parts that make an output transaction possible. This may at first sound illogical; when we say that to be able to send bitcoins out of our wallet, we need an input transaction first, but don’t forget that Bitcoin protocol knows no accounts. Therefore, the only way to move anything in Bitcoin is to change the state of the global UTXO set. What makes an output transaction applicable is the existence of a previous input transaction that we use to prove our ownership of bitcoins.

If we look more closely at the example transaction shown previously, we recognize that we don’t have a complete knowledge about bitcoins that came in. All we see is a transaction ID (txid) and another vout key, similar to those we saw in our own output transactions. And there is a large chunk of data under scriptSig that contains unreadable hexadecimal strings.

"vin": [

{

"txid": "e5244f26b62119a2bc45e6c7c8975243662e487f7e9369a9dcc4fafa0f8190c4",

"vout": 0,

"scriptSig": {

"sequence": 4294967295

}

The solution to this puzzle lies in the way how Bitcoin network and its nodes operate. Each time a node wants to send bitcoins, it will collect the inputs from UTXOs it controls by using its own private keys. Those inputs are coming from other output transactions of course, and by going back in time, we would basically reach the very first transaction in the Genesis Block. Everything in Bitcoin can be traced back, because everything is part of the global UTXO set. The bitcoins never change; only the state of the global UTXO set does. In our preceding case, we want to use “some bitcoins” from the transaction with id:

e5244f26b62119a2bc45e6c7c8975243662e487f7e9369a9dcc4fafa0f8190c4

But as this transaction could have many outputs, we must point at the one where our “input bitcoins” come from. For this we are using the vout key and the number 0, which means the first entry from the list of outputs coming from this transaction. Here we declare a previous output to become our own input. But as we know, we can’t simply point at things in Bitcoin and reclaim ownership. We have to provide a valid proof of claimed ownership. To successfully complete this task, we must also include another piece of code, the unlocking script, that will be executed by Bitcoin’s scripting engine. The result of this execution must be the Boolean value of TRUE.

If we retrieve and decode this transaction with getrawtransaction and decoderawtransaction commands , we’ll find the UTXO containing 12.54BTC together with the locking script at the entry scriptPubKey.

"vout": [

{

"value": 12.54321347,

"n": 0,

"scriptPubKey": {

"asm": "OP_DUP OP_HASH160 c6ca904d6c12bd983b983ae21b7699dde96a3a5f OP_EQUALVERIFY OP_CHECKSIG",

"hex": "76a914c6ca904d6c12bd983b983ae21b7699dde96a3a5f88ac",

"reqSigs": 1,

"type": "pubkeyhash",

"addresses": [

"1K87TMWzsQLzdZb4kUmKAcrCWnQqdVW5QJ"

]

}

These “input” bitcoins have been used to generate the next transaction from the example at the beginning. But they could only have been used after the puzzle given by scriptPubKey has been solved. What we see in scriptPubKey is one-half of the complete script that has to be executed by Bitcoin’s scripting engine. The script itself contains a few keywords (OP_DUP, OP_HASH160, OP_EQUALVERIFY) and a hexadecimal value, which will be discussed later. For now, it is sufficient to know that this half of the script demands that anyone who claims to be the owner of these bitcoins must provide the rest of the script so that it can be executed successfully. What’s missing right now is the other half: a signature and a corresponding public key. This data will be included when the UTXO of this transaction should become input of another transaction, that is, when someone wants to spend those funds. The value in scriptSig from the referencing transaction we discussed previously is the unlocking logic we need to spend the funds. The moment these two halves get combined and executed is when the scripting engine checks a transaction for validity. It is also important to know that there is no state during executing of scripts. We say that Bitcoin Script environment is stateless, as there are no preconditions for any script to be executed. The Engine takes the two scripts and checks if it can produce a TRUE by executing them one after another. There is nothing else that could ever influence this execution. Either a script contains everything that’s needed and it succeeds or it fails.

In the early versions of Bitcoin, both of the scripts were executed together. This was later changed due to security reasons, because it was possible for an attacker to spend funds without being the actual owner. Nowadays, the scripting engine executes scriptSig first, and if there are no errors, it executes scriptPubKey as well. If it succeeds at the end, the new transaction will be allowed to spend the referenced input bitcoins. This new transaction will later be included in one of the upcoming blocks. But if the attempted transaction fails, it will get discarded and can’t make it into a block. This is how Bitcoin ensures that only those who possess private keys can access the corresponding funds. However, unlocking scripts don’t have to be signatures and public keys only, but in most cases the unlocking script will contain such data.

In our preceding example, the owner was able to provide valid unlocking logic, and therefore the transaction that wanted to spend 12.54BTC made it into the block. Every input transaction references some output transaction, because for bitcoins to be spent, they must have come from some other transaction. The only exception to this rule is the very first transaction in the Genesis Block, where it didn’t have a preceding transaction, because there was no block before Genesis Block itself. This is also the reason why the initial transaction can never be spent, because there is no valid unlocking script for it. The irony is that the very first 50 bitcoins are locked forever. According to some calculations, four million bitcoins have been lost due to various reasons.³³

Bitcoin Script

Everything that touches the blockchain is a script.³⁴ From a purely technological perspective, we could say that Bitcoin is a giant script execution engine, because every time we send or receive funds, we do create transactions that carry little scripts of different complexities with them. In most of the cases our scripts transfer ownerships from one address to another. And the language used to execute them started its existence as an embedded part of Bitcoin’s source code. However, unlike most other programming languages, this one never was designed upfront, with a properly described grammar and syntax. It didn’t even have a name, which is the reason why it’s still simply being called “Script”. Its semantics and structure resemble an obscure language designed in the 1970s, called Forth.³⁵ Unlike most of the modern programming languages, it uses the stack as its machine abstraction. This means that it pushes and pops certain data and operators to and from the stack during execution. To describe its algorithms, it uses the Reverse Polish Notation .³⁶

For example, to write and execute the addition operation in a language similar to Script, you wouldn’t use the familiar 2 + 3 but instead 2 3 +. This notation might look pretty unusual, but as Script uses the stack as its machine abstraction, the Polish Reverse Notation is a perfect match. Let’s imagine a stack structure inside our machine’s memory that we fill with the preceding data as shown in Figure 2-12. According to our new notation, we’ll first have to push the two numbers into the stack. The next step will be to read the operator +.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig12_HTML.jpg — Figure 2-12
Pushing the elements into the stack

Now the question is, when is the operation executed? The answer is simple: as soon as our hypothetical scripting engine reads the sign +, the two operands will be popped from the stack and used in the addition operation. Subsequently, the result would be pushed back into the stack as shown in Figure 2-13.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig13_HTML.jpg — Figure 2-13
Result got pushed back into the stack

This is how stack machines work. In Bitcoin’s script there is an operator, or Op-Code, named OP_ADD , that’s being used instead of the + sign. In general, one can push as many data into the stack as the stack memory allows, but as soon as an operator is being read, the stack machine would execute it by taking as many operands as the operator expects and then later pushing back any results into the stack.

When it comes to Bitcoin’s scripting engine, as long as the last value returned is a Boolean TRUE or any other nonzero value, the execution is considered successful. The last value remaining in the stack must be of that type to mark the respective transaction as valid.

Of course, if we provide scripts with missing operands or wrong operators, we’d get an error. There’s also a memory limit regarding script sizes, so the flexibility of putting as many data as one wants is only in theory. To prevent spam and other attacks, Bitcoin’s scripting engine imposes several constraints. As every participating node must validate each script, the possibility of running “bloated” scripts would open an attack surface for DDoS-ing³⁷ the whole network.

Let’s now try to follow the execution of the locking script from scriptPubKey in the JSON-formatted transaction structure we used previously. The operators are prefixed with OP_, while the only operand is the hexadecimal value in the middle of the script.

OP_DUP OP_HASH160 c6ca904d6c12bd983b983ae21b7699dde96a3a5f OP_EQUALVERIFY OP_CHECKSIG

We see two operators: OP_DUP and OP_HASH160 (we will from now on write operators without the OP_ prefix). Then we see a hexadecimal value, probably a hash. At the end of the string, we see two more operators EQUALVERIFY and CHECKSIG . In total, we have four operators but only one operand. This script doesn’t look like it could be executed successfully. Indeed, this script could never succeed, because it is designed to remain incomplete as long as there is no valid unlocking script, which we have to take from scriptSig in the input transaction that spends those 12.54 bitcoins. As we already talked about UTXOs, to be able to spend them, one must provide a valid unlocking script. In most cases the unlocking script contains a signature and a public key of the spender, but there are also other ways to define them. We will for now stick with this option as it represents the majority of unlocking scripts. The content under the key hex in vin of the spending scriptSig look like this:

473044022051acb984fe96ebc690ac7987c4531875bb39466bb557b33ecc952d514842407c02202d691613c6974f53381d938b83b883a795e81e40ef07d8917eb9ce115bcc0c0d012103ebbc9c06350d95f1a50273b3bbb46f51827168afea87b8240bc0c2adc520202e

This is the serialized content of spender’s digital signature and public key. The term serialization means that the binary representation of data in memory gets converted into its physical form, for example, as an entry in a text file. The opposite of it, deserialization , is the process of reconstructing data in memory based on some physical representation.

The reason we use signatures and public keys here is to express a commitment to a certain transaction. We are referencing a previous transaction, whose data we want to use as inputs for our spending, or output transaction. Our transaction therefore is the message we’re sending to the network. And to prove that we are indeed allowed to spend claimed bitcoins, which are the inputs referenced by our transaction, we provide our signature and public key. In the next step, every validating node would receive our message and validate it. By utilizing the properties of the asymmetric cryptography, every node will then execute the combined locking and unlocking scripts to check for validity of our claim of ownership. This is how a complete script and operation on it would look like. The parts in bold are from our scriptSig.

[Sig] [PK] OP_DUP OP_HASH160 [PKHash] OP_EQUALVERIFY OP_CHECKSIG

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig14_HTML.jpg — Figure 2-14
Script content before execution

In Figure 2-14 we see that Bitcoin Script has prepended the signature and public key to the locking script taken from scriptPubKey . We now have the complete script that can be executed. The execution is done by reading data and commands from left to right. In the preceding figure, we see the script execution pointing at the first entry and moving further to the right. By default, the scripting engine pushes data into the stack until it reads an Op-Code. The Op-Codes don’t get pushed into the stack but executed as they aren’t data but code with certain logic embedded. Often, an Op-Code requires a certain number of elements to be popped from the stack and returns a value to be pushed back into the stack.

However, a command can never take an element from the stack that’s below the topmost element. The only way to pop the elements is one by one, beginning with the topmost element. In our case the engine pushes two elements, one after another, onto the stack: a signature and a public key. Then, after the stack pointer has read DUP, it would execute its logic, in this case it’s duplication of elements, by popping the first element, the public key, and then pushing back two public key elements into the stack, one after another of course. The pointer now moves further to the right and reads another command (Figure 2-15).

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig15_HTML.jpg — Figure 2-15
Execute DUP

The command we execute next is HASH160 as shown in Figure 2-16. This command pops one element, the Public Key, and returns its hash.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig16_HTML.jpg — Figure 2-16
Hashing the Public Key

We then push the Hash value from the script into the stack (Figure 2-17).

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig17_HTML.jpg — Figure 2-17
Pushing the hash value from script into the stack

The next command shown in Figure 2-18 is EQUALVERIFY that checks for equality between two hashes and therefore will pop two topmost values to complete its operation. The goal of this operation is to check if the expected public key hash from the locking script equals to the hash of the provided public key from the unlocking script. If the result is not TRUE, the script execution would fail immediately. Otherwise, it would continue reading the next data or Op-Code. Unlike previous operations, EQUALVERIFY³⁸ returns nothing. There exists a related operation called EQUAL which would return TRUE, but in our case we need no return value but instead a continuation of script execution that is given by using EQUALVERIFY .

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig18_HTML.jpg — Figure 2-18
Checking for hash equality

Our last step is shown in Figure 2-19. Here we have to check if the given signature corresponds to the public key. If successful, the CHECKSIG operation would push a TRUE into the stack, which would signal the Script Engine to accept the transaction and allow the UTXO in question to be spent.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig19_HTML.jpg — Figure 2-19
Checking if signature corresponds with Public Key

Although Script offers a wide range of Op-Codes and even branching in form of special IF-ELSE constructs, it is still not a Turing complete³⁹ programming language. This means that Script has no loop constructs like most other languages do (for-each, while-do, loop-until, and similar semantics). This design choice was deliberate to prevent code execution that could exhaust node’s resources. Imagine a script with a never-ending for-each loop. This would be disastrous as nodes could never complete their validations tasks, which would ultimately lead to a halt of the blockchain.

Wallets

The term wallet in Bitcoin is both a misnomer and something that means very different things. For example, the standard Bitcoin Core application is called a wallet. Additionally, the structure that is managing keys inside the application is called wallet too. And then there are hardware wallets, small devices for keeping private keys in a secure way. But irrespective of its particular usage, the term wallet still remains misleading as it suggests that it’s containing bitcoins, which is impossible as those never leave the chain. More precisely, bitcoins don’t exist at all as Bitcoin protocol only deals with UTXOs, the unspent transaction outputs. The visual representation in form of bitcoin balances is done in the application stack that is located above the raw Bitcoin protocol environment. Therefore, it’d be much better to call all those different “wallets” keychains, because that’s what they really do: keeping and managing key pairs .

There exist two variants of wallets, nondeterministic and deterministic . The nondeterministic wallets are available since the very first version of Bitcoin Core software and basically mean that each key pair is created separately from any other key pair. Such wallets are basically sets of randomly generated key pairs. This of course makes them pretty hard to manage and restore, as we would have to keep track of every key pair and also make sure of not using an address twice, which is always a bad practice as it reveals too much information about our transactions and funds. In general, the usage of nondeterministic wallets is strongly discouraged.

The deterministic wallets,⁴⁰ also called HD⁴¹ (hierarchical deterministic) wallets, are wallets, whose private keys are derived by using a one-way function that consumes a special input data called “seed”. The seed is technically a randomly generated number that later gets combined with additional data like index number and “chain codes” to generate private keys. The advantage of deterministic wallets is that one only needs seed data to restore a complete wallet, regardless how many private keys have been generated already. Unlike nondeterministic wallets, the private and public keys in deterministic wallets are derived one from another. Therefore, to recreate such a wallet, only the initial seed is needed. The rest would follow automatically.

It is also possible to reuse the seed between different software and hardware tools as long as they support the same regeneration schema. Currently, there exist two variants of deterministic wallets, Type-1 and Type-2. Type-1 wallets use their seeds to create private keys, which in turn can be used to produce further private keys, and so on. Basically, they create a linear structure with the seed at the top as shown in Figure 2-20. The technique behind this key generation lies in the application of the random number. We start with the seed that produces the first private key; the second private key then will be generated by using a random number that’s related to seed’s random number. The second private key will then be based on random data from the previous key and so on. This way we only have to keep the seed as all other random numbers will be directly or indirectly based on it anyway.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig20_HTML.jpg — Figure 2-20
Type-1 deterministic wallet

Although sufficient for private use, such wallets have a significant weak point: it is not possible to create public keys without having access to corresponding private keys. This of course makes them unusable for use-cases where keys are exposed to untrusted public, for example, merchant applications. Imagine a web-facing shop application that offers a Pay-with-Bitcoin option. To generate a public key for each new invoice, this web server would have to have access to seed and private keys simultaneously, which would make it an ideal target for attacks. In such cases we need an option to generate public keys without having access to any private key.

To overcome this problem, a new variant of nondeterministic wallets, Type-2, was created. Such wallets are able not only to generate keys based on a single seed but can create whole sequences of child-keys, which in turn can have their own child-keys and so on. Also, Type-2 wallets can generate public keys without having access to private keys, which solves the abovementioned problem as we see in Figure 2-21.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig21_HTML.jpg — Figure 2-21
Type-2 deterministic wallet

With Type-2 wallets we can create different branches that could, for example, represent divisions within a company. Each division could have its own “root” private key that would not disclose any of its siblings, thus making it impossible to guess other keys and their structures. Also, a branch could be solely based on a public key alone like in the web-facing merchant application mentioned previously.

The most important part of HD wallets is of course the seed which can be expressed by using mnemonics that are words used from a vocabulary of 2048 words, which are based on one of the supported human languages. The BIP⁴² that defines it is BIP39.⁴³ The HD wallet is defined in BIP32. The creation of a seed is done by application of several mathematical functions:

Generate a random sequence between 128 and 256 bits.
Calculate a checksum of the above sequence by taking the first 4 bytes from its SHA256 hash.
Expand the sequence by adding the checksum (it now becomes 132 bits long).
Split the sequence into equal 11-bit chunks.
Map those chunks to words from a 2048-word vocabulary.

The result is a mnemonic that would contains words like these:

strategy hunt session whale galaxy vocal skull better evoke tool inhale hybrid

The corresponding root private key would look like this:

xprv9s21ZrQH143K2doWLwAwAvkbicMRWfL7tZrtfSVUqNu8n56q9iKUH8G7zChrm1maSVVBt44DyktN6GXEag8L2sY6ee8FofNRGdZwEfikPSM

As we see, this private key looks quite differently from those we saw before. It is longer and it has a different prefix, xprv. This is because after we have created the mnemonic sequence, additional steps need to be taken to produce the actual seed number. The human-readable mnemonic is only meant for us to be kept in a safe environment. The next steps are what create the actual numeric seed value.

Apply the PBKDF2 ⁴⁴algorithm by giving it the mnemonic and a “salt” value as inputs. Optionally, the salt can be expanded by entering a passphrase that adds additional randomness to the procedure. If no passphrase was entered, the value of salt would only contain the string value “mnemonic”.
PBKDF2 algorithm would then stretch the original mnemonic by applying 2048 rounds of hashing with HMAC-SHA512 algorithm. The resulting value would be 512-bit long. This is the actual seed.

This seed is then used to create the master private (xprv) and public (xpub) keys as shown in Figure 2-22.

../images/486570_1_En_2_Chapter/486570_1_En_2_Fig22_HTML.jpg — Figure 2-22
Generating master private and public keys

These keys can later be used to create “normal” private and public keys. The seed also contains the chain code (the right 256 bits of the 512-bit seed), which is used to create different key chains within the tree structure. This way we can split up the wallet into several divisions, each with its own master private key and child-keys.

The optional passphrase should not be understood as “password” but moreover as a way of creating different wallets based on the same word mnemonic. This strategy can be applied to create different “wallet variants” that could serve different purposes. In extreme cases, for example, extortions, a person could “give away” the seed without giving the accompanying passphrase. The wallet based on seed alone would, for example, contain only a small amount of funds. The “correct” wallet, the one with larger amounts, could only be recreated with the passphrase. The passphrase is actually a pointer to a variant of a wallet, as there can be many of them based on the same seed but with different passphrases.

To experiment with the technology behind HD wallets a web site located at https://iancoleman.io/bip39/ is recommended. The various options offered there allow for testing of different scenarios. It is also possible to run this page locally, without a network connection, which should always be done when creating wallets for real-world usage.

Summary

In this chapter we have learned how Bitcoin works and what its most important parts are. We have looked into the Blockchain, the peer-to-peer network, with its various node types and the Bitcoin’s scripting engine. We have learned to understand scripts and have written and executed one that’s sending funds from one address to another. We have learned that Bitcoin only sees transactions and that everything else revolves around this concept. Without transactions Bitcoin wouldn’t be possible. Although Bitcoin knows nothing about “coins”, we have learned that various applications can help us abstract away many complexities from the underlying architecture. However, we have also learned that Bitcoin maintains a specific structure called the UTXO set, which transitions from one state to another each time a new block gets created. To send funds in Bitcoin actually means to change the UTXO set, which is the number of unspent transaction outputs. Ultimately, we have learned that the ecosystem around Bitcoin offers different options on how to keep our private keys safe, from old wallet types that carry around bunches of keys to highly sophisticated industry standards like HD wallets, which help us create and maintain separate key chains. We have delved into the technology behind HD wallet creation and followed the various steps toward a complete HD wallet based on a single seed mnemonic that can be safely stored outside any software application or device.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 2. Bitcoin Technology

Create new playlist

Sign In

Sign Up

2. Bitcoin Technology

Blockchain

Peer-to-Peer Networking

Node Types

Signatures

Addresses

Transactions

Bitcoin Script

Wallets

Summary

Table of Contents for
2. Bitcoin Technology