rfc:random_extension_improvement

PHP RFC: Random Extension Improvement

Introduction

Several issues with the "Random Extension 5.x" RFC were only raised after voting started (due to lack of proper discussion):

Engine implementations are not final

The Random Extension has classes that are natively implemented as RNG engines, but they are not marked as final. This allows classes to be created that inherit from native classes, but as stated in the previous RFC, user-implemented engines are inferior to native classes in terms of execution efficiency. This is true even for inheritance without method overrides, which often leads to confusion.

The extension already provides a Random\Engine interface with a single generate(): string method. Even if the native classes are made final, it is easy for the user to create an alternative class using delegates.

This is clearly an API design error, and the native implementations of the interface should be marked as final.

Random\SerializableEngine is not useful

This interface is a remnant from when the Serializable interface was still useful. This interface is no longer needed in PHP, as serializability is now determined by the existence of a magic method.

For this reason, remove Random\SerializableEngine.

This means that an Engine that implements SerializableEngine will no longer implement it. However, serializability is determined by the implementation of the magic method in current PHP, so it has no effect.

Random\Engine\CombinedLCG is low quality

The newly added Random\Engine\CombinedLCG is only for use with PHP's lcg_value() function. However, this algorithm is very classical and the quality of the output random numbers is at the lowest level.

In order to preserve the implementation of the lcg_value() function, the internal implementation is retained, but the implementation as a class is being dropped to prevent users from unintentionally using it.

There is no equivalent of array_rand()

array_rand() uses RNG internally, but there is no alternative method in Randomizer. As per the previous RFC, this was the intent, but upon further investigation, array_rand() is used by many packages and should probably be drop-in replaceable.

So add a method Randomizer::pickArrayKeys(array $array, int $num): array.

It may look incompatible with array_rand(array $array, int $num = 1): int|string|array, but you can get completely consistent results by doing the following:

$array = ['foo', 'bar', 'baz'];
 
// Before:
mt_srand(1234, MT_RAND_PHP);
$single = array_rand($array); // (int) 0
$multiple = array_rand($array, 2); // (array) [1, 2]
 
// After:
$engine = new Random\Engine\Mt19937(1234, MT_RAND_PHP);
$randomizer = new Random\Randomizer($engine);
$single = $randomizer->pickArrayKeys($array, 1)[0]; // (int) 0
 // or: [$single] = $randomizer->pickArrayKeys($array, 1);
$multiple = $randomizer->pickArrayKeys($array, 2); // (array) [1, 2]

"string" means a binary

In PHP, “string” means a binary. This is often a problem when using multibyte characters. For example, str_shuffle() on a Japanese (UTF-8) string will give messed up results.

Therefore, it may be better to change the alternative method of str_shuffle(), Randomizer::shuffleString(), to Randomizer::shuffleBytes(). This is a more appropriate name.

Engine classnames are not precise

To clearly identify the implemented algorithm, the PCG64 and MersenneTwister engines should be renamed to their canonical upstream name:

  • Random\Engine\PCG64Random\Engine\PcgOneseq128XslRr64
  • Random\Engine\MersenneTwisterRandom\Engine\Mt19937

PCG is not so famous

PCG is a very good algorithm, boasting great randomness and performance. However, I think its name recognition is in some ways inferior to Vigna's RNG, which started with Xorshift.

Therefore, I reimplement Xoshiro256**, which was previously mentioned as a candidate, and create the class Random\Engine\Xoshiro256StarStar. This will avoid the problem of using the old MT19937 since it is not familiar with PCG.

Proposal

For each of these issues, we will create a ballot option and make a decision.

Engine implementations are not final

Make all implemented engines final?
Real name Yes No
asgrim (asgrim)  
bwoebi (bwoebi)  
colinodell (colinodell)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
levim (levim)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
pierrick (pierrick)  
santiagolizardo (santiagolizardo)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
timwolla (timwolla)  
weierophinney (weierophinney)  
Final result: 18 1
This poll has been closed.

Random\SerializableEngine is not useful

Remove the SerializableEngine interface?
Real name Yes No
asgrim (asgrim)  
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
levim (levim)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
pierrick (pierrick)  
santiagolizardo (santiagolizardo)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
twosee (twosee)  
weierophinney (weierophinney)  
Final result: 18 0
This poll has been closed.

Random\Engine\CombinedLCG is low quality

Remove the CombinedLCG class?
Real name Yes No
asgrim (asgrim)  
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
levim (levim)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
pierrick (pierrick)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
timwolla (timwolla)  
twosee (twosee)  
weierophinney (weierophinney)  
Final result: 18 0
This poll has been closed.

There is no equivalent of array_rand()

Add the pickArrayKeys() method to the Randomizer?
Real name Yes No
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
twosee (twosee)  
weierophinney (weierophinney)  
Final result: 12 2
This poll has been closed.

"string" means a binary

Rename Randomizer::shuffleString() to Randomizer::shuffleBytes()?
Real name Yes No
asgrim (asgrim)  
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
levim (levim)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
pierrick (pierrick)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
twosee (twosee)  
weierophinney (weierophinney)  
Final result: 16 1
This poll has been closed.

Engine classnames are not precise

Rename PCG64 and MersenneTwister?
Real name Yes No
asgrim (asgrim)  
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
nicolasgrekas (nicolasgrekas)  
ocramius (ocramius)  
pierrick (pierrick)  
sergey (sergey)  
svpernova09 (svpernova09)  
theodorejb (theodorejb)  
timwolla (timwolla)  
weierophinney (weierophinney)  
Final result: 14 1
This poll has been closed.

PCG is not so famous

Add the Xoshiro256StarStar engine and class?
Real name Yes No
bwoebi (bwoebi)  
crell (crell)  
galvao (galvao)  
ilutov (ilutov)  
kalle (kalle)  
kguest (kguest)  
nicolasgrekas (nicolasgrekas)  
nikic (nikic)  
ocramius (ocramius)  
sergey (sergey)  
svpernova09 (svpernova09)  
timwolla (timwolla)  
twosee (twosee)  
weierophinney (weierophinney)  
Final result: 13 1
This poll has been closed.

Backward Incompatible Changes

The following names have been reserved and will no longer be available:

  • Random\Engine\Mt19937
  • Random\Engine\PcgOneseq128XslRr64
  • Random\Engine\Xoshiro256StarStar

The following class names will be made available again:

  • Random\Engine\CombinedLCG
  • Random\Engine\MersenneTwister
  • Random\Engine\PCG64
  • Random\SerializableEngine

Proposed PHP Version(s)

8.2

RFC Impact

To SAPIs

none

To Existing Extensions

none

To Opcache

none

New Constants

none

php.ini Defaults

none

Patches and Tests

References

rfc/random_extension_improvement.txt · Last modified: 2022/07/22 09:55 by guilliamxavier