rfc:precise_float_value

This is an old revision of the document!


PHP RFC: More precise float value handling

Introduction

This RFC is based on the discussion about displaying float values in json_encode and proposes more precise float value handling overall.

JSON is used to exchange data between systems. Although JSON RFC "6 Numbers" does not require specific implementation for float/int type, float value should be handled as precise as possible by default.

Currently, json_encode() uses EG(precision) which is set to 14. That means that 14 digits max are used for displaying (printing) the number. IEEE 754 double supports higher precision and serialize()/var_export() uses PG(serialize_precision)=17 to be more precise. Since json_encode() uses EG(precision), json_encode() removes lower digits of fraction parts and destroys original value even if PHP's float could hold more precise float value.

<?php
$j = '{ "v": 0.1234567890123456789 }';
var_dump(json_decode($j));
var_dump(json_encode(json_decode($j)));
ini_set('precision', 20);
var_dump(json_decode($j));
var_dump(json_encode(json_decode($j)));
var_dump(0.1234567890123456789);
?>
object(stdClass)#1 (1) {
  ["v"]=>
  float(0.12345678901235)
}
string(22) "{"v":0.12345678901235}"
object(stdClass)#1 (1) {
  ["v"]=>
  float(0.12345678901234567737)
}
string(28) "{"v":0.12345678901234567737}"
float(0.12345678901234567737)

PHP's float type stores “raw” IEEE 754 double and could display accurate fraction value up to 17 digits.

Current PHP outputs meaningless values for oversized EG(precision)/PG(serialize_precision).

<?php
$v = 0.12345678901234567890;
var_dump($v);
ini_set('precision', 100);
var_dump($v);
?>
float(0.12345678901235)
float(0.12345678901234567736988623209981597028672695159912109375)

That is caused by used mode for double to string conversion.

Proposal

This RFC proposes to introduce EG(precision)=-1 and PG(serialize_precision)=-1 that uses zend_dtoa()'s mode 0 which uses better algorigthm for rounding float numbers (-1 is used to indicate 0 mode).

The RFC also proposes changing ini for JSON precision to PG(serialize_precision).

Followings are sample codes and outputs of the proposed patch.

NEW behavior

<?php
$v = 10.0000000000001;
 
ini_set('precision', -1);
ini_set('serialize_precision', -1);
 
var_dump($v);
echo var_export($v, true), PHP_EOL;
echo json_encode($v), PHP_EOL;
echo $v, PHP_EOL;
?>
float(10.0000000000001)
10.0000000000001
10.0000000000001
10.0000000000001

OLD behavior

<?php
$v = 10.00000000000001;
 
ini_set('precision', 14);
ini_set('serialize_precision', 17);
 
var_dump($v);
echo var_export($v, true), PHP_EOL;
ini_set('serialize_precision', 14);
echo json_encode($v), PHP_EOL;
ini_set('serialize_precision', 17);
echo $v, PHP_EOL;
?>
float(10)
10.000000000000011
10
10

Please note that IEEE float cannot store exactly precise values. e.g. Result of “10/3” - see phpt of the patch. Even with this proposal, there will be rounding errors, but the behavior becomes similar to other languages and values are more precise in many cases.

Backward Incompatible Changes

Setting mode 0 as default can mean that the rounding will be more precise which also means that the rounding might be different in var_export()/serialize().

The BC break could happen only if someone would rely on exact output but that shouldn't be the case. All our existing tests passes when 0 mode is used.

None when old INI value is used.

Proposed PHP Version(s)

  • PHP 7.1

RFC Impact

To SAPIs

None.

To Existing Extensions

PHP overall

  • 0 mode (EG(precision)= -1) float outputs values rounded to nearest.

Standard module and JSON

  • serialize(), var_export(), json_encode - Use PG(serialize_precision) and 0 mode by default.

To Opcache

Not affected.

New Constants

None.

php.ini Defaults

precision

  • hardcoded default values : 14 Unmodified
  • php.ini-development values : 14 Unmodified
  • php.ini-production values : 14 Unmodified

serialize_precision

  • hardcoded default values : -1
  • php.ini-development values : -1
  • php.ini-production values : -1

Open Issues

None.

Unaffected PHP Functionality

PHP uses “raw” IEEE 754 value internally regardless of precision settings. Therefore, this RFC does not affect internal computation.

Future Scope

WDDX

  • wddx_serialize_vars/value() - Use PG(serialize_precision) and 0 mode. It uses EG(precision) currently.

XML_RPC

  • xmlrpc_encode() - Use PG(serialize_precision) and 0 mode. It uses EG(precision) currently.

Proposed Voting Choices

Requires a 50%+1 majority

There will be two votings

  • whether to introduce mode 0 and use it as default for serialize_precision
  • should PG(serialize_precision) be used instead of EG(precision) in json_encode.

Patches and Tests

The initial PR can be found here:

Note that the PR is currently outdate but it will be updated if the RFC is accepted.

Implementation

After the project is implemented, this section should contain

  1. the version(s) it was merged to
  2. a link to the git commit(s)
  3. a link to the PHP manual entry for the feature

References

Rejected Features

Keep this updated with features that were discussed on the mail lists.

rfc/precise_float_value.1465153045.txt.gz · Last modified: 2017/09/22 13:28 (external edit)