AlmostRandom generates random number from a few entropy sources within the Arduino. Unlike traditional methods that rely on hardware noises or pseudo-random algorithms, AlmostRandom combines multiple not-so-ideal sources to produce random numbers that are hard to predict and chaotic, suitable for casual, recreational, and non-critical applications.
This library harnesses various sources of randomness, including the parity of analogRead() values, the contents of RAM, the values of internal timers and micros() and millis().
AlmostRandom serves as a compelling alternative to the built-in random() function, offering numbers closer to true randomness and increased unpredictability, but takes a longer time to generate (1-2 milliseconds each on traditional AVR Arduino).
Please note that AlmostRandom is designed for non-critical applications and may not be suitable for tasks requiring high levels of cryptographic strength or security.
- Updates
- Disclaimer
- Compatible Hardware
- Random Numbers
- Public Functions
- Extra: Setup Photos
- Extra: Manual Setup for Ramdom and Ranclock
- License
- 0.21.0
- Ready to go.
- 0.10.0
- Finished implementing most methods.
- 0.6.0
- Implemented Ranalog methods.
This library is meant for educational and recreational use only, do not use the random numbers generated by this library for critical and high-stake use cases like encryption and important decisions.
In this library and documentations, there may be inaccurate or incomplete information regarding random numbers, like their nature, generation, usage and testing methods. I seek your forgiveness and understanding as I am not an expert in random numbers or microcontrollers.
This library primarily targets and is compatible with:
Board | MCU | Core |
---|---|---|
Arduino Uno R3 | ATmega328P | Official Arduino |
Arduino Leonardo | ATmega32u4 | Official Arduino |
Arduino Mega | ATmega2560 | Official Arduino |
My DIY Dev Board | ATtiny3224/ 3226/3227 |
megaTinyCore |
ESP32-S3-DevKitC* | ESP32-S3 | Official Arduino |
Feel free to try other MCUs and let me know if this library works.
*ESP32-S3, while supported, is not recommended to be used with this library as it has its own dedicated random number generator. It is probably a lot faster and a lot more random than my library. The two default timers chosen for Ranclock seemed not to be in use, and thus you have to turn them on manually using:
uint32_t* G0T0_CTRL = (uint32_t*)0x6001F000;
uint32_t* G1T0_CTRL = (uint32_t*)0x60020000;
*G0T0_CTRL |= (1<<31);
*G1T0_CTRL |= (1<<31);
This should be done in setup()
. Feel free to play around with the prescalers should you wish to.
MCU | ramStart |
ramEnd |
timerA CountAddress |
timerB CountAddress |
analogRead() Pin |
---|---|---|---|---|---|
ATmega328P | 0x0100 | 0x08FF | 0x46 (Timer 0) |
0x84 (Timer 1) |
A0 |
ATmega32u4 | 0x0100 | 0x0AFF | 0x46 (Timer 0) |
0x84 (Timer 1) |
A0 |
ATmega2560 | 0x0200 | 0x21FF | 0x46 (Timer 0) |
0x84 (Timer 1) |
A0 |
ATtiny3224/ 3226/3227 |
0x3400 | 0x3FFF | 0x0A20 (Timer A0) |
0x0A9A (Timer B1) |
0 |
ESP32-S3 | 0x3FC88000 | 0x3FCFFFFF | 0x6001F004 (Group 0 Timer 0, latch on 0x6001F00C) |
0x60020004 (Group 1 Timer 0, latch on 0x6002000C) |
1 |
There are generally two kinds of random numbers: true random or pseudo-random. Some examples of true random sources include radioactive decay, atmospheric electrical noises and some quantum phenomena. They are not the easiest to harvest and processed into random numbers in a typical home setting.
As such, for non-critical tasks, pseudo-random numbers are used. They are often generated from a formula using a seed as a starting condition.
An example of such formula is the Middle-Square method. If 123
is used as a seed and squared, the answer will be 15129
. We can then extract the middle three digits of 15129
to use as our random number, which will be 512
. 512
can then be used as a seed for the next random number.
One commonly used seed is the Unix Time, or the number of seconds passed since 1 January 1970, which has the advantage of being unique every second.
A major problem with pseudo-random numbers is that if you knows the seed and the formula, you can predict the outcome. Poor formula may also result in pseudo-random numbers looping in a sequence.
There are also methods that are in-between. A coin toss may seem random but if you know all the initial conditions for the toss, you will be able to predict the outcome. However due to how unpredictable and chaotic a casual coin toss is, it is practically random for a day-to-day non-critical use case. This library aims to deliver such numbers between true and pseudo-random.
Instead of generating random numbers from one source, this library will be using various sources of not-so-ideal random numbers, throw them into a pool of entropy to get a better one out of them.
Evaluating the quality of random numbers is a subject requiring vast mathematical knowledge which I do not possess. I will try to keep things within my capability by looking at the frequency distribution graph and standard deviation.
A frequency distribution graph shows how often a number is being generated, which gives us a clue on how each possible value (0-255, one byte, in our case) is being represented. Spikes will mean that value is over represented while dip means under represented.
Standard deviation measures the variation of a reading about its mean average, with lower number suggesting a smaller spread or better consistency.
I assume that the fairest random number generator will generated all possible values with equal chances as the number of readings approach infinity.
First, I looked at some results from an existing random number generator. In my case, I chose the famous random.org. I generated three sets of 10,000 random bytes:
The graph looks well distributed and the average standard deviation is 74.1. The height of the graph fits within 20 to 60 on the Y-axis (frequency). We will use this as a sign of having good quality random numbers.
On the other end, I wanted to look at how bad random numbers look like. I generated three sets of 10,000 low quality random numbers.
The first set has
- 100 preset numbers, followed by
- 100 random numbers from random.org, followed by
- The same 100 preset numbers, followed by
- Freshly generated 100 random numbers from random.org
- So on until there are 10,000 numbers
The second set has a 1,000 preset/random numbers in similar fashion as the first.
The third set as 1,000 preset numbers looping till the end of 10,000 total numbers with no random numbers.
Set 1 has obvious spikes as the preset numbers occur far more often than the random ones. Set 3 has obvious dips as some numbers are not represented in the preset and due to the looping nature, are not represented at all throughout.
Set 2 is the least obvious among the three, thanks to the repeating preset being larger in size, while "camouflaged" among true random numbers.
I noticed that the standard deviation does not tell us too much. The good random numbers from random.org netted us 74.1 while the horrible Set 1 and Set 3 is 76.4 and 74.7 respectively, not far away. I think it is still notable, as extreme standard deviation can still be a sign that something is horribly wrong with the random numbers.
Lastly, this are the results from using the built-in random()
function in Arduino:
As expected, the same seed yields the same result over three different boards. To be fair, the quality of the pseudo-random numbers appears to be decent, except for the fact they are very predictable.
The reading from analogRead has been a staple as a seed for the Arduino built-in pseudo-random number generator. It usually goes like this:
randomSeed( analogRead(A0) );
long randNumber = random(10, 10000);
Serial.println(randNumber);
The idea is that a floating (unconnected) pin will measure surround electromagnetic interference (EMI) to produce a random reading. However, calling analogRead()
on the ATtint3224 10,000 times and plotting the readings seems to tell a different story:
It seems like less than half of the possible values from 0 to 1023 are produced at least once, and there is a discernible 'U' shape with spikes at the end. This may not even make for a good seed. The standard deviations are more than 100, making them worse than the poor quality random numbers I designed in Evaluating Random Numbers.
Out of curiosity, I decided to look at the parity of the readings. It means seeing how many of the readings are odd and how many are even:
Throughout three sets of 10,000 readings, the ratios of odd to even number come close to 50:50. This effectively makes the parity of analogRead()
on the ATtiny3224 a coin toss. A bit can be a one or zero, so if the microcontroller unit (MCU) "tosses" this "coin" eight times, it will be able to produce one random byte (eight bits in one byte). I dub this the "Ranalog" method.
I then repeat the analogRead experiment with the Arduino Uno R3 and Arduino Leonardo:
Arduino Uno R3 | Arduino Leonardo | |
---|---|---|
analogRead Frequency | ||
Parity Distribution |
Those look worse than the readings from the ATtiny3224 and there are way too many odd numbers and zeros for the case of Uno. The parity distribution of Leonardo looks decent. Regardless, my aim is to combine various sources of not-so-ideal random numbers, hence I implemented the Ranalog method to take a look:
ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|
ATtiny3224 has the best results, while surprisingly Leonardo has the worst, despite having good parity ratio.
I suppose that extending the pin with a wire will make it into an antenna, and hopefully this antenna will be able to pick up EMI more readily, as such, I attached a 6.5 and 10.0cm antenna to the development boards. Since we can get each bit of the byte from a different analog pin,
I also tested out multiple antennae on as many analog ports as the development board allow. When there are not enough readily available analog pins, I will cycle them from the first one.
This is what I get:
Antenna(e) | ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|---|
6.5cm | |||
10.0cm | |||
Multi |
The ATtiny3224 starts off with a good distribution but did not seem to get much better with longer or more antennae, which I suspect is due to poor grounding on my DIY development board, since the board is pretty small after all.
The Uno starts off pretty badly but gets better with longer and more antennae.
Meanwhile the Leonardo seems put out poor random numbers with too many zeros across the board, with little improvement when the there are longer and more antennae, which may suggest good grounding and shielding.
Supposedly, uninitialised random access memory (RAM) are not guaranteed to be all zeros or all ones, and an Arduino does not seems to initialise its RAM other than those parts that are being actively used (Not sure for the ESP32S3 though). Also, the values of the RAM are constantly changing as the program is running, hence, it is possible to use RAM values as a random source.
On the internet, there are some criticisms of using RAM for random numbers, mainly:
- Uninitialised or not, RAM are not that random and can produce looping sequences
- It is unsafe to meddle with contents of the RAM
I dubbed this method "Ramdom" and what happens is the MCU reads the contents of its RAM in bytes, and XOR-ing every one of them:
ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|
While the Uno and Leonardo seem to display some decent results, the ATtiny3224 seems to have lots of spikes. Again, since I am going to combine an entropy pool of not-so-ideal random numbers, I am not going to be too concerned about those spikes here.
The MCUs did not crash during any of these tests, it seems like it is safe to read (not write) unknown parts of the RAM.
When I ask, chatGPT gives me the idea that the differences between two timers from the same clock source can produce random numbers due to jitters and software-related inaccuracies. As we know, we should never trust chatGPT without verification, thus I decided to XOR two bytes, one from each timer of the Arduino and see if the results are random. I chose two timers that are normally already active so you will not need to manually enable them. I dub this "Ranclock".
ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|
ATtiny3224 seems to produce the best results of all, though it should be noted all three sets seem to produce graphs of similar shape, which might mean some kind of sequence looping. Meanwhile the results for Uno and Leonardo are filled with spikes and dips. Similar to the other methods, I am adding Ranclock to the entropy pool.
Human input can be one source of random number, since we humans are unlikely to perform an action precisely to the milliseconds or microseconds. Thus, we can use millis()
and micros()
as a source of random number tied to the moment when a user interface with the machine, usually by pressing a button.
I dub this method "Rainput". Since millis()
and micros()
return the time in four bytes unsigned long, I can XOR these eight bytes (four from each) to get a random byte.
Initially it seems like a repetition of Ranclock, since both deal with timers and clock, but the resulting frequency distribution graphs looks quit different:
ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|
These random numbers look pretty good to be added to the pool, though it is hard to tell if there are any looping sequences, especially since all three sets from Leonardo exhibit similar shapes.
Taking one random byte from each of the four methods, we can once again XOR them to give us the final random byte:
Setup | ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|---|
Minimal | |||
Maximal |
The minimal setup uses one pin with no antenna for analogRead, while the maximal one uses all possible analog pins on the development board with antennae of various length. All of them give decent results and it did not seem to matter if antennae are used or not.
Here are the overview for the other boards:
Arduino Mega | ESP32S3 Dev Kit C |
---|---|
I produced five sets of 10,000 random numbers to get the average time needed to generate one of them. All timings in milliseconds.
Method | ATtiny3224 | Arduino Uno R3 | Arduino Leonardo |
---|---|---|---|
Ranalog | 0.12440 | 0.89600 | 0.89632 |
Ramdom | 1.23216 | 1.03100 | 1.29536 |
Ranclock | 0.00064 | 0.00084 | 0.00084 |
Rainput | 0.04766 | 0.05660 | 0.05684 |
AlmostRandom | 1.41178 | 1.99264 | 2.25728 |
Tests are not done on the ESP32-S3 and Mega, but due to their large RAM, I suspect it will take much longer than these. Feel free to adjust the portion of the RAM to be used for AlmostRandom.
Obviously, more research is needed to prove the quality of these random numbers, but I would say these should be good enough for most non-critical, unimportant, causal, recreational activities requiring random numbers. I would think this would be good enough for lotteries, lucky draws or to seed pseudo-random numbers.
Note: When passing an address to a function, be sure to cast it as a pointer first.
For example, if you want to pass 0x100
as a parameter to void function(byte* address)
, write it as function((byte*)0x100)
.
Get a new random 8-bit byte, which is a number between 0 to 255. This uses all enabled methods (Ranalog, Ramdom, Ranclock or Rainput).
Get the previously generated random byte.
The run code denotes which random methods were used when getRandomByte()
was last called.
Bit | |||||||
---|---|---|---|---|---|---|---|
7 | 6 | 5 | 4 | 3 | 2 | 1 | 0 |
Not used. | Not used. | Not used. | Not used. | 1 if Ranalog was used, else 0. | 1 if Ramdom was used, else 0. | 1 if Ranclock was used, else 0. | 1 if Rainput was used, else 0. |
You may use a bitmask to check if one particular random methods were used or not.
Get a random signed 16-bit integer (-32,768 to 32,767), which is int
for most classic AVR Arduinos. This is accomplished by concatenating multiple random bytes from getRandomByte()
.
Get a random unsigned 16-bit integer (0 to 65,535), which is unsigned int
for most classic AVR Arduinos. This is accomplished by concatenating multiple random bytes from getRandomByte()
.
Get a random signed 32-bit integer (-2,147,483,648 to 2,147,483,647), which is long
for most classic AVR Arduinos. This is accomplished by concatenating multiple random bytes from getRandomByte()
.
Get a random unsigned 32-bit integer (0 to 4,294,967,295), which is unsigned long
for most classic AVR Arduinos. This is accomplished by concatenating multiple random bytes from getRandomByte()
.
Get a random float
, which is 32-bits for most classic AVR Arduinos. You can get between -32,768 to 32,767 and inf
. This works by using getRandomInt()
twice, casting the results to float
and dividing them. The value stored in float
may not be exact.
Enables the Ranalog method if myEnable
is true
and disables if it is false
. This is enabled by default.
Returns true
is the Ranalog method is enabled, else false
.
Set myAnalogPin
to be used with Ranalog. All eight bits will be generated from this pin.
Set an array of pins, myAnalogPins[]
, to be used with Ranalog. This array needs to have eight elements. Each element corresponds to one pin where one of the bits will be generated.
For example myAnalogPins[0]
could contain Pin A0, where it will be analogRead()
to determine the 0th random bit of the byte.
You can choose to repeat pins if there are not enough readily accessible pins for that supports analogRead()
.
If myEvenIsZero
is true
, an even analogRead()
result will be a 0
. Else if myEvenIsZero
is false
, it will be 1
.
Returns true
if an even analogRead()
result produces a 0
in the random byte, else false
.
Get a random byte using just the Ranalog method.
Get the last byte generated using the Ranalog method. This includes generation from getRandomByte()
and methods that relies on getRandomByte()
, if Ranalog was enabled.
Enables the Ramdom method if myEnable
is true
and disables if it is false
.
Returns true
is the Ramdom method is enabled, else false
.
Set the starting address of RAM to read using myRamStart
and then ending (inclusive) address with myRamEnd
.
Get a random byte using just the Ramdom method.
Get the last byte generated using the Ramdom method. This includes generation from getRandomByte()
and methods that relies on getRandomByte()
, if Ramdom was enabled.
Enables the Ranclock method if myEnable
is true
and disables if it is false
.
Returns true
is the Ranclock method is enabled, else false
.
Exclusive method for ESP32S3. myTimerACountAddress
sets the address to get the count from the first timer and myTimerBCountAddress
to get the count from the second timer. You need to enable these timers manually, see here.
Exclusive method for ESP32S3. myTimerALatchAddress
corresponds to the update address for myTimerACountAddress
for the readings to be latched and read while myTimerBLatchAddress
is for myTimerBCountAddress
.
myTimerACountAddress
sets the address to get the count from the first timer and myTimerBCountAddress
to get the count from the second timer. Most of the time, the timers should already be enabled on classic AVR Arduinos.
Get a random byte using just the Ranclock method.
Get the last byte generated using the Ranclock method. This includes generation from getRandomByte()
and methods that relies on getRandomByte()
, if Ranclock was enabled.
Enables the Rainput method if myEnable
is true
and disables if it is false
.
Returns true
is the Rainput method is enabled, else false
.
Get a random byte using just the Rainput method.
Get the last byte generated using the Rainput method. This includes generation from getRandomByte()
and methods that relies on getRandomByte()
, if Rainput was enabled.
This static function returns a C-string of the the binary representation of myLong
. It will be of bitCount
length and zero padded. This can be useful to visualise and inspect the random numbers.
On classic AVR Arduinos, myLong
will be 32-bits (0 to 4,294,967,295) while bitCount
will be 8-bits (0-255). Casting signed integers to unsigned long
should also work.
For example, toBin(25, 8)
will return "00011001". toBin(100, 4)
will return "0100".
Since this is a static method, you can call AlmostRandom::toBin(_unsigned long_ myLong, _byte_ bitCount)
.
A helper library is also included to help sort the generated random numbers if needed. I used to use quicksort but I realised the AVR MCUs' Serial become unstable after a while, perhaps due to memory issues. Thus I chose to use Insertion Sort since it has a relatively small memory footprint and supposedly fast for small datasets.
To use, kindly #include<InsertionSort.h>
. Also remember to include the datatype in the diamond operator when calling these static functions, for example InsertionSort<byte>::sort(myArray, 10)
.
Sort the array arr
in ascending order, you must make sure the function gets the correct size
of the array that is going to get sorted.
Each element in the array will be printed to Serial with a space in between. Note that this uses Arduino's Serial.print()
, and thus not every datatype can be printed.
Same as printArray(T arr[], unsigned int size)
, but prints a new line after the last space.
If the you are using a MCU that is not detected by this library, you need to set up Ramdom and Ranclock manually using the relevant setter functions. The start and end of the RAM as well as functionalities relating to timers are often referenced by their memory addresses, and those are what will be used for the setups.
Case in point, there is a memory address to the first byte and one for the last byte of the RAM. There are also memory addresses to the registers of the peripherals (like timers), which are special devoted memory to store their configurations and data.
Memory addresses are often written in hexadecimal, for example 0x0100
. 0x
denotes that 0100
is in hexadecimal instead of decimal. Hexadecimal numbers goes like this: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, F, 10, 11, 12 ... 19, 1A, 1B ... 1F, 20, 21 ... They are of base 16 instead of the regular base 10 decimal numbers.
If you need help with hexadecimal arithmetic, you can use the Programmer mode in the Windows Calculator program. Remember to select "HEX" when doing calculation in hexadecimals.
To know exactly which memory addresses are needed, you can refer to the datasheet of the MCU used, which can often be downloaded from the manufacturer’s website.
There are usually two ways the addresses are represented, I have no idea if there exist a terminology for them but I call them:
- Direct addresses
- Offset addresses
Here we have some examples to demonstrate.
This is the datasheet for the ATMega32u4. Under "AVR Memories", you can see the the different parts of the memory, and right there we can see the start and end address for the SRAM:
If we scroll down the table of contents, we can locate the section regarding one of the timers, which you can read for more information:
In the subsection of "Timer/Counter Register - TCNT0", we find the register that stores the count of the timer, however no address is given:
There is usually a page which shows the summary of all the registers in the MCU, and with some searching, we can find it along with address of TCNT0. In this case we use the address in the brackets, which is 0x46
, and I have no idea the difference between the two addresses:
This is the direct address of TCNT0, you can read from this address as-is. Now we look at the same timer from ATMega328PB, this time you will realise the address is written directly on the page documenting the timer register:
We can double check the register summary, which gives us the same 0x46
result:
Looking at the datasheet of ATtiny3224, we see that the address for Timer/Counter Type A is 0x20
:
However, if we look at the register summary, which is called "Peripheral Address Map" in this case, we see that TCA0 (Timer/Counter Type A Instance 0) has a base address of 0x0A00. We also note that there are two instances of Timer/Control Type B with different addresses:
In this case for peripherals in groups, the addresses are often given as an offset, where actual address = base address + offset
. The offset is usually the same for all instances of the same peripheral group while each of them has a different base address.
In this case, the count address for TCA0 will be 0x0A00
+ 0x20
= 0x0A20
.
To recap:
- RAM and timer are controlled using memory addresses
- Memory addresses can be found in datasheets
- Datasheet may provide you with the direct memory addresses
- Datasheet may also present addresses as offsets from a base address
- Check the register summary, which can be called other names, usually involving "register", "address" or "map"
The memory in the memory address can be read from or written to. One address usually corresponds to one byte of memory, so to read the byte at 0x0100
, we can write:
// If you do not cast 0x0100 to byte*,
// it will be treated as a regular integer rather than an address.
byte* memAdd = (byte*)0x0100;
byte memVal = *memAdd;
and to write to the address,
byte* memAdd = (byte*)0x0100;
*memAdd = 255; // or 11111111 in binary
Note that it is not safe to write to unknown addresses in RAM and some peripheral register addresses are read-only.
If the data to be read from or written to is less than one byte, you may use bitmasking to do so. Refer to the "Extra: Bitmasking" section under MCUVoltage.
I urge you to take the time to study the datasheet, to find out the functionalities and operations of the peripherals of the MCU.
AlmostRandom generates random number from a few entropy sources within the Arduino.
Copyright (C) 2024 cygig
This library is free software; you can redistribute it and/or modify it under the terms of the GNU Lesser General Public License as published by the Free Software Foundation; either version 2.1 of the License, or (at your option) any later version.
This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Lesser General Public License for more details.
You should have received a copy of the GNU Lesser General Public License along with this library; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA