Main dataset, available as http://www.vo.elte.hu/bitcoin/blockchain_new.zip
===========================================================================
1. blockhash.txt enumeration of all blocks in the blockchain, 397571 rows, 4 columns:
blockID integer id used in this database (continous range [0,397570] )
bhash block hash (identifier in the blockchain, 64 hex characters)
btime creation time (from the blockchain)
txs number of transactions
2. txhash.txt transaction ID and hash pairs, 108996613 rows, 2 columns:
txID integer id used in this database (continous range [0,108996612])
txhash transaction hash used in the blockchain (64 hex characters)
3. addresses.txt BitCoin address IDs, 124857865 rows, 2 columns:
addrID integer id used in this database (continuous range [1,124857865], the special value addrID == 0 is used for invalid / missing addresses)
addr string representation of the address (alphanumeric, maximum 35 characters; note that the IDs are NOT ordered by the addr)
4. tx.txt enumaration of all transactions, 108996613 rows, 4 columns:
txID transaction ID (same as in the txhash.txt file)
blockID block ID (same as in the blockhash.txt file)
n_inputs number of inputs
n_outputs number of outputs
5. txin.txt list of all transaction inputs (sums sent by the users), 287902875 rows, 4 columns:
txID transaction ID (same as in the txhash.txt file)
i input ID (i.e. an integer identifying individual inputs, a txID -- i pair will appear only once in this file)
addrID sending address (same as in the addresses.txt file)
value integer sum in Satoshis (1e-8 BTC -- note that the value can be over 2^32, use 64-bit integers when parsing)
6. txout.txt list of all transaction outputs (sums received by the users), 322855563 rows, 4 columns:
txID transaction ID (same as in the txhash.txt file)
i output ID (i.e. an integer identifying individual outputs, a txID -- i pair will appear only once)
addrID receiving address (same as in the addresses.txt file)
value integer sum in Satoshis (1e-8 BTC -- note that the value can be over 2^32, use 64-bit integers when parsing)
7. nonstandard.txt a list of transactions with nonstandard outputs, 962274 rows, 1 column; these refer to transactions where the receiving address cannot be decoded; in these cases the addrID in in the txout.txt and txin.txt files is 0; corresponding outputs and inputs can be linked together with the txlinks.txt file:
txID transaction ID (same as in the txhash.txt, txout.txt, txlinks.txt, txin.txt files)
8. multiple.txt a list of transaction outputs, where multiple addresses receive the sum together; for these, only the first address is present in the txin.txt and txout.txt files (the rest can be identified from this file); 1033384 rows, 3 columns:
txID transaction ID
i output ID
addrID receiving addresses (a txID -- i pair will appear with multiple addresses)
Extra data (optional):
======================
http://www.vo.elte.hu/bitcoin/txlinks.zip
txlinks.txt links connecting transaction outputs with previous transcation inputs, 287902875 rows, 4 columns:
txID transaction ID (from the txhash.txt file)
prev_txID ID of previous transaction (whose output is being spent)
i input ID (note: one txID -- i pair appears only once)
prev_i output ID in the previous transcatoin (note: one prev_txID -- prev_i pair appears only once)
http://www.vo.elte.hu/bitcoin/contraction.zip
contraction.txt list of addresses possibly belonging to the same user, 124857864 rows, 2 columns:
addrID address ID (from the addresses.txt file)
userID ID of identified user (not continuos, each two addrID which belong to the same "user" appear as inputs in the same transaction at least once)