Hello Cardano Community,
I would like to provide an important update to my previously published findings on the potential weaknesses in the entropy distribution of BIP-39 mnemonic phrases. You can find the original post here for reference:
Original Report on Cardano Forum
Expanded Dataset and New Observations
Since the initial report, I have significantly expanded the dataset and refined the analysis process, leading to new insights that further highlight statistical anomalies worth community attention:
- Increased Sample Size: The number of valid wallets generated using 24-word mnemonic phrases has now exceeded 87,000.
(Note: All 12-word phrases have been excluded in this phase to focus purely on 24-word patterns.)
- Unexpected 13-Word Valid Phrases:
During the generation process for 24-word mnemonics, several 13-word phrases were observed to pass validation and generate valid wallets.
This is highly unusual and may point to implementation inconsistencies in some libraries or platforms. - Statistical Repetition (Word Frequency):
- First-word position: Some words now appear over 970 times.
- Middle positions: Repetitions range between 530 and 709 occurrences.
- Last-word position: Stable but still notably high, around 468 repetitions.These figures represent a sharp increase compared to the earlier findings, where no word exceeded 100 appearances across any position.
Interpretation
To clarify:
- I am not questioning the BIP-39 specification itself.
- Nor am I pointing fingers at specific implementations.
However, these new statistical outliers could hint at:
- Insufficient entropy in some mnemonic generators.
- Implementation bugs allowing invalid phrases (e.g., 13-word acceptance).
- A potential reduction in brute-force complexity under certain conditions.
Final Notes
In the near future, I will be publishing a new update containing more precise statistics and sensitive insights. While these findings may not be relevant to everyone, they are of great significance to me as a researcher deeply focused on the technical patterns and implications within this domain. The upcoming data will require heightened attention and analytical focus, given its complex nature and depth.
I would like to express my sincere gratitude to the Cardano community for providing such an open and respectful platform to share my research and engage in meaningful dialogue. This kind of academic and technical openness reflects a mature and intellectually rich ecosystem, making Cardano a truly welcoming environment for researchers and security professionals alike.
Important Note:
The purpose of this research is not to promote or encourage any form of unlawful or unethical behavior. It is purely conducted in the spirit of academic research, aimed at creating a safer Web3 environment and securing user assets to the highest degree possible.
Let’s move beyond destructive criticism and hostile replies. Whether you’re a researcher, developer, or blockchain enthusiast, we all share the same responsibility:
To embody the spirit of progress, innovation, and mutual respect.
With appreciation,
Best regards,
Okba [GUIAR OQBA]
Security Researcher
techokba@gmail.com