[RFC] Use multi-arm bandits for selecting which sequence of transactions to mutate #1438
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
This branch contains some code to experiment with multi-armed bandit strategies for fuzzing smart contracts in Echidna. The code is not working yet, and it will require some changes in hbandit (which is not included in this PR yet)
The idea is to treat each sequence of transactions in the corpus as an arm, and update its reward based on whether mutations of that sequence lead to new coverage. This allows the fuzzer to adaptively prioritize sequences that are empirically more likely to yield interesting behaviors, rather than relying solely on recency-based heuristics.
Key points:
Overall, this PR aims to set the foundation for adaptive fuzzing using multi-armed bandits in Echidna, enabling systematic exploration of whether learned prioritization of sequences can outperform simple recency-based heuristics.