blockchain, netsync: implement a complete headers-first download during ibd #2428

kcalvinalvin · 2025-09-22T16:18:14Z

Change Description

Right now the headers-first download is only based off of the checkpoints and is thus limited to the last checkpoint.
The newly implemented headers-first download will always download headers-first and will validate them to see if the headers connect and have proper proof of work.

Then the block download will be based off of the verified headers. This now eliminates any potential downloading of txs or orphan blocks during ibd. It also makes future parallel block download much better as a parallel block download can only happen for blocks we already have headers for.

~~It's not yet put into the code yet but this allows the node to also receive block headers instead of invs during block propagation.~~ I'll do this in a follow up later.

Steps to Test

Perform ibd on any of the networks and see that the node syncs up fine.
Check that the newly added function tests are correct.

Pull Request Checklist

Testing

Your PR passes all CI checks.
Tests covering the positive and negative (error paths) are included.
Bug fixes contain tests triggering the bug to prevent regressions.

Code Style and Documentation

The change is not insubstantial. Typo fixes are not accepted to fight bot spam.
The change obeys the Code Documentation and Commenting guidelines, and lines wrap at 80.
Commits follow the Ideal Git Commit Structure.
Any new logging statements use an appropriate subsystem and logging level.

📝 Please see our Contribution Guidelines for further guidance.

We add a chainview of bestHeaders so that we'll be able to keep track of headers separately from the bestChain. This is needed as we're getting headers for new block annoucements instead of invs.

Since we may now have blockNodes with just the block header stored without the data, we add a new status to account for this.

coveralls · 2025-09-22T16:22:33Z

Pull Request Test Coverage Report for Build 19373443826

Details

139 of 254 (54.72%) changed or added relevant lines in 6 files are covered.
65 unchanged lines in 6 files lost coverage.
Overall coverage increased (+0.5%) to 55.433%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
blockchain/chainio.go	1	3	33.33%
blockchain/chain.go	11	22	50.0%
netsync/manager.go	40	142	28.17%

Files with Coverage Reduction	New Missed Lines	%
mempool/mempool.go	1	66.56%
btcutil/gcs/gcs.go	3	81.25%
database/ffldb/blockio.go	4	88.81%
rpcclient/infrastructure.go	4	47.8%
netsync/manager.go	18	7.05%
blockchain/chainio.go	35	69.75%

Totals
Change from base Build 19121431262:	0.5%
Covered Lines:	31425
Relevant Lines:	56690

💛 - Coveralls

maybeAcceptHeader performs checks to accept block headers into the header chain. This function allows for a true headers-first download where we only accpet block headers for downloading new blocks.

saubyk · 2025-10-21T16:51:07Z

cc: @gijswijs for review

saubyk · 2025-10-21T16:51:50Z

cc: @mohamedawnallah for review

Roasbeef

Great set of changes! I found the commit structure very easy to follow as well.

Have performed an initial review pass, but will start to run some active tests on one of my nodes to get a better feel for the changes.

Roasbeef · 2025-10-31T23:26:59Z

blockchain/blockindex.go

 }

+// HaveHeader returns whether the header data is stored in the database.
+func (status blockStatus) HaveHeader() bool {


Isn't this just the inverse of HaveData?

Then you can't communicate not having headers for a given blockhash so I think having this is better.

You can use !HaveData() to check if you have the headers but then what do you do when you want to check if you have a block header.

blockchain/process_test.go

netsync/manager.go

Roasbeef · 2025-10-31T23:50:41Z

netsync/manager.go

+	// If we're in the initial block download mode, check if we have
+	// peers that we can download headers from.
+	_, bestHeaderHeight := sm.chain.BestHeader()
+	higherHeaderPeers := sm.fetchHigherPeers(bestHeaderHeight)


Even though many years have passed at this point, we should retain the logic that we removed to filter out non-segwit peers once segwit is active.

In sm.fetchHigherPeers(), we filter out peers that aren't sync candidates. The peers can only be sync candidates if they're a segwit activated peer.

This check is done in isSyncCandidate()

netsync/manager.go

ProcessBlockHeader performs chain selection and context-free & contextual validation for the given block header. The function allows a header-first downloading of blocks even without checkpoints.

On flushes to the database, we check that the blockNodes we have for the download block headers are not flushed to the disk unless the block data is stored as well for backwards compatibility. With older btcd clients, they rely on the fact that the blockNode is present to check if the block data is also present. Since we now store blockNodes for just the block headers, this is no longer true. Because of this, we don't flush the blockNodes if there's no accompanying block data for it. This results in the downloading and verifying the headers again if the node were to restart but since the header data is small and the verification is quick, it's not a big downside.

In block processing and block downloading, HaveBlock is used to check if the block data already exists. It was ok to just check for the existence of the blockNode but we now we also need to check if the data exists as the blockNode may be present for just the block header.

The added exported methods on BlockChain provide access to the block header tip like fetching block hashes and heights from the headers chain.

checkHeadersList takes in a blockhash and returns if it's a checkpointed block and the correct behavior flags for the verification of the block.

fetchHigherPeers provides a convenient function to get peers that are sync candidates and are at a higher advertised height than the passed in height.

isInIBDMode returns if the SyncManager needs to download blocks and sync to the latest chain tip. It determines if it's in ibd mode by checking if the blockchain thinks we're current and if we don't have peers that are at higher advertised blocks.

fetchHeaders picks a random peer at a higher advertised block and requests headers from them.

Instead of the old headerList based header processing, we make use of the new ProcessBlockHeader function.

headers Since we now utilize ProcessBlockHeader, we change the fetchHeaderBlocks to be based off of the block index instead of the headerList in SyncManager.

ince we now utilize ProcessBlockHeaders, we change the startSync function to utilize the block index for downloading blocks/headers instead of using the headerList in SyncManager.

handleBlockMsg is updated to not be based off of the headerList and sm.nextCheckpoint when doing operations related to checkpoints and headers.

Since we no longer utilize the headerList for doing headers-first download and checkpoint tracking, we remove related code.

ibdMode is a more fitting name than headersFirstMode since all blocks are downloaded headers-first during the initial block download.

kcalvinalvin · 2025-11-14T18:08:04Z

Addressed all the review comments by roasbeef

Roasbeef · 2025-12-18T02:37:57Z

Tested in the wild, and does what it says on the tin:

Details

``` 2025-12-17 18:32:10.885 [INF] SYNC: Downloading headers for blocks 136001 to 2401568 from peer PEER_A:18333 2025-12-17 18:32:10.885 [INF] SYNC: Lost peer PEER_B:18333 (outbound) 2025-12-17 18:32:16.309 [INF] SYNC: New valid peer PEER_C:18333 (outbound) (/Satoshi:27.1.0/) 2025-12-17 18:32:40.885 [INF] SYNC: Downloading headers for blocks 136001 to 4807558 from peer PEER_D:18333 2025-12-17 18:32:40.885 [INF] SYNC: Lost peer PEER_A:18333 (outbound) 2025-12-17 18:32:41.044 [INF] SYNC: New valid peer PEER_E:18333 (outbound) (/Satoshi:28.1.0/) 2025-12-17 18:32:41.135 [INF] SYNC: New valid peer PEER_F:18333 (outbound) (/Satoshi:27.0.0/) 2025-12-17 18:32:46.438 [INF] CHAN: Verified checkpoint at height 200000/block 0000000000287bffd321963ef05feab753ebe274e1d78b2fd4e2bfe9ad3aa6f2 2025-12-17 18:32:51.245 [INF] SYNC: New valid peer PEER_G:18333 (outbound) (/Satoshi:28.1.0/) 2025-12-17 18:32:55.171 [INF] CHAN: Verified checkpoint at height 300001/block 0000000000004829474748f3d1bc8fcf893c88be255e6d7f571c548aff57abf4 2025-12-17 18:33:03.103 [INF] CHAN: Verified checkpoint at height 400002/block 0000000005e2c73b8ecb82ae2dbc2e8274614ebad7172b53528aba7501f5a089 2025-12-17 18:33:10.885 [INF] SYNC: Downloading headers for blocks 490001 to 4807558 from peer PEER_G:18333 2025-12-17 18:33:10.885 [INF] SYNC: Lost peer PEER_D:18333 (outbound) 2025-12-17 18:33:13.135 [INF] CHAN: Verified checkpoint at height 500011/block 00000000000929f63977fbac92ff570a9bd9e7715401ee96f2848f7b07750b02 2025-12-17 18:33:25.990 [INF] CHAN: Verified checkpoint at height 600002/block 000000000001f471389afd6ee94dcace5ccc44adc18e8bff402443f034b07240 2025-12-17 18:33:38.331 [INF] CHAN: Verified checkpoint at height 700000/block 000000000000406178b12a4dea3b27e13b3c4fe4510994fd667d7c1e6a3f4dc1 2025-12-17 18:33:40.884 [INF] SYNC: Downloading headers for blocks 718001 to 4807558 from peer PEER_E:18333 2025-12-17 18:33:40.888 [INF] SYNC: Lost peer PEER_G:18333 (outbound) 2025-12-17 18:33:46.254 [INF] SYNC: New valid peer PEER_H:18333 (outbound) (/Satoshi:25.1.0/) 2025-12-17 18:33:50.353 [INF] CHAN: Verified checkpoint at height 800010/block 000000000017ed35296433190b6829db01e657d80631d43f5983fa403bfdb4c1 2025-12-17 18:34:01.433 [INF] CHAN: Verified checkpoint at height 900000/block 0000000000356f8d8924556e765b7a94aaebc6b5c8685dcfa2b1ee8b41acd89b 2025-12-17 18:34:10.884 [INF] SYNC: Downloading headers for blocks 986001 to 4807558 from peer PEER_E:18333 2025-12-17 18:34:10.884 [INF] SYNC: Lost peer PEER_E:18333 (outbound) 2025-12-17 18:34:10.884 [INF] SYNC: Downloading headers for blocks 986001 to 4807558 from peer PEER_H:18333 2025-12-17 18:34:11.269 [INF] SYNC: New valid peer PEER_I:18333 (outbound) (/Satoshi:25.1.0/) 2025-12-17 18:34:11.270 [INF] SYNC: New valid peer PEER_J:18333 (outbound) (/Satoshi:0.20.1/) 2025-12-17 18:34:13.213 [INF] CHAN: Verified checkpoint at height 1000007/block 00000000001ccb893d8a1f25b70ad173ce955e5f50124261bbbc50379a612ddf 2025-12-17 18:34:16.312 [INF] SYNC: New valid peer PEER_K:18333 (outbound) (/Satoshi:27.0.0/) 2025-12-17 18:34:25.081 [INF] CHAN: Verified checkpoint at height 1100007/block 00000000000abc7b2cd18768ab3dee20857326a818d1946ed6796f42d66dd1e8 ```

One thing I noticed though is that the peers may D/C us along the way, so we rotate through a few peers to fetch all the headers. This is testnet3 , there's quite a lot of headers (over 4 million). Are we hitting something like default per-peer upload limit for bitcoind?

IIUC this is a stepping stone for multi-peer header download, so perhaps that'll be sorted out as we'll fetch blocks of headers from many peers in parallel.

Roasbeef

LGTM 🗑️

kcalvinalvin added 2 commits September 23, 2025 00:55

blockchain: add bestHeader to BlockChain

b66e2f8

We add a chainview of bestHeaders so that we'll be able to keep track of headers separately from the bestChain. This is needed as we're getting headers for new block annoucements instead of invs.

blockchain: add statusHeaderStored for blockNode status

05ac50c

Since we may now have blockNodes with just the block header stored without the data, we add a new status to account for this.

saubyk assigned kcalvinalvin Sep 23, 2025

kcalvinalvin force-pushed the 2025-09-16-new-parallel-block-downloads branch 3 times, most recently from 2308a1e to 3519720 Compare September 23, 2025 09:43

blockchain: add maybeAcceptHeader

11661ac

maybeAcceptHeader performs checks to accept block headers into the header chain. This function allows for a true headers-first download where we only accpet block headers for downloading new blocks.

kcalvinalvin force-pushed the 2025-09-16-new-parallel-block-downloads branch 5 times, most recently from 242fa3b to e6fadc9 Compare September 25, 2025 07:25

kcalvinalvin marked this pull request as ready for review September 25, 2025 13:17

Roasbeef reviewed Oct 31, 2025

View reviewed changes

kcalvinalvin added 13 commits November 14, 2025 18:47

blockchain: add ProcessBlockHeader

d875a8d

ProcessBlockHeader performs chain selection and context-free & contextual validation for the given block header. The function allows a header-first downloading of blocks even without checkpoints.

blockchain: add exports methods based on block header tip

e2b720f

The added exported methods on BlockChain provide access to the block header tip like fetching block hashes and heights from the headers chain.

netsync: add checkHeadersList

02960f6

checkHeadersList takes in a blockhash and returns if it's a checkpointed block and the correct behavior flags for the verification of the block.

netsync: add fetchHigherPeers

b1a6b59

fetchHigherPeers provides a convenient function to get peers that are sync candidates and are at a higher advertised height than the passed in height.

netsync: add isInIBDMode

c38e741

isInIBDMode returns if the SyncManager needs to download blocks and sync to the latest chain tip. It determines if it's in ibd mode by checking if the blockchain thinks we're current and if we don't have peers that are at higher advertised blocks.

netsync: add fetchHeaders

b1da380

fetchHeaders picks a random peer at a higher advertised block and requests headers from them.

netsync: use ProcessBlockHeader in handleBlockHeader

f48fb15

Instead of the old headerList based header processing, we make use of the new ProcessBlockHeader function.

netsync: change fetchHeaderBlocks to be based on the processed block

ab25495

headers Since we now utilize ProcessBlockHeader, we change the fetchHeaderBlocks to be based off of the block index instead of the headerList in SyncManager.

netsync: change startSync to be based off of processed headers

e301a55

ince we now utilize ProcessBlockHeaders, we change the startSync function to utilize the block index for downloading blocks/headers instead of using the headerList in SyncManager.

netsync: update handleBlockMsg

7d0b28e

handleBlockMsg is updated to not be based off of the headerList and sm.nextCheckpoint when doing operations related to checkpoints and headers.

netsync: remove unused headerList and checkpoint code

a239b97

Since we no longer utilize the headerList for doing headers-first download and checkpoint tracking, we remove related code.

netsync: rename headersFirstMode to ibdMode

2f4c749

ibdMode is a more fitting name than headersFirstMode since all blocks are downloaded headers-first during the initial block download.

kcalvinalvin force-pushed the 2025-09-16-new-parallel-block-downloads branch from e6fadc9 to 2f4c749 Compare November 14, 2025 18:07

saubyk requested a review from Roasbeef November 14, 2025 18:39

Roasbeef approved these changes Dec 18, 2025

View reviewed changes

blockchain, netsync: implement a complete headers-first download during ibd #2428

Are you sure you want to change the base?

blockchain, netsync: implement a complete headers-first download during ibd #2428

Conversation

kcalvinalvin commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Change Description

Steps to Test

Pull Request Checklist

Testing

Code Style and Documentation

Uh oh!

coveralls commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 19373443826

Details

💛 - Coveralls

Uh oh!

saubyk commented Oct 21, 2025

Uh oh!

saubyk commented Oct 21, 2025

Uh oh!

Roasbeef left a comment

Choose a reason for hiding this comment

Uh oh!

Roasbeef Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Roasbeef Oct 31, 2025

Choose a reason for hiding this comment

Uh oh!

kcalvinalvin Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kcalvinalvin commented Nov 14, 2025

Uh oh!

Roasbeef commented Dec 18, 2025

Uh oh!

Roasbeef left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kcalvinalvin commented Sep 22, 2025 •

edited

Loading

coveralls commented Sep 22, 2025 •

edited

Loading