rfc:mixed_vs_untyped_properties

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
rfc:mixed_vs_untyped_properties [2023/11/16 18:36] imsoprfc:mixed_vs_untyped_properties [2023/11/20 23:02] (current) imsop
Line 1: Line 1:
 ====== PHP RFC: Harmonise "untyped" and "typed" properties ====== ====== PHP RFC: Harmonise "untyped" and "typed" properties ======
-  * Version: 0.2 +  * Version: 0.5 
-  * Date: 2023-11-16+  * Date: 2023-11-20
   * Author: Rowan Tommins <imsop@php.net>   * Author: Rowan Tommins <imsop@php.net>
   * Status: Draft   * Status: Draft
Line 8: Line 8:
 ===== Introduction ===== ===== Introduction =====
  
-This RFC proposes to remove the distinction between "typed" and "untyped" properties, by treating any property with no type information as though it was declared ''mixed''. This is primarily aimed to reduce confusion around different states, error messages, and behaviours; to do so, it makes the language stricter in some circumstances, and changes the error message in others.+This RFC proposes to remove or at least minimise the distinction between "typed" and "untyped" properties, by treating any property with no type information as though it was declared ''mixed''. This is primarily aimed to reduce confusion around different states, error messages, and behaviours.
  
 PHP currently has three primary ways of adding properties to an object: PHP currently has three primary ways of adding properties to an object:
Line 16: Line 16:
   - Declared with a type. In addition to being allocated on every instance, the property is covered by extra guards on assignment to guarantee its type. If it is never assigned a value, or passed to ''unset'', it is assigned a special "uninitialized" state.   - Declared with a type. In addition to being allocated on every instance, the property is covered by extra guards on assignment to guarantee its type. If it is never assigned a value, or passed to ''unset'', it is assigned a special "uninitialized" state.
  
-The different behaviours of these properties are largely a result of the history of the language, rather than a consistent design. In particular, with [[rfc:mixed_type_v2|the addition of the ''mixed'' type]], it would seem logical for <php>private $foo;</php> to be short-hand for <php>private mixed $foo;</php>, since no type-guards are needed; but this is not currently the case, due to the different handling of initial states and ''unset''.+The different behaviours of these properties are largely a result of the history of the language, rather than a consistent design. In particular, with [[rfc:mixed_type_v2|the addition of the ''mixed'' type]], it would seem logical for <php>private $foo;</php> to be short-hand for <php>private mixed $foo;</php>, since no type-guards are needed; but this is not currently the case.
  
 ===== Current Behaviour ===== ===== Current Behaviour =====
 +
 +==== Initial State and Unset ====
  
 The three types of property vary in their //initial// state, and in their state //after calling ''unset''//, as can be seen in these three demos, which run the same code with the three types of property: [[https://3v4l.org/ln7Or|dynamic property]], [[https://3v4l.org/U2DFZ|untyped property]], [[https://3v4l.org/3igb4|typed property]]. The three types of property vary in their //initial// state, and in their state //after calling ''unset''//, as can be seen in these three demos, which run the same code with the three types of property: [[https://3v4l.org/ln7Or|dynamic property]], [[https://3v4l.org/U2DFZ|untyped property]], [[https://3v4l.org/3igb4|typed property]].
Line 24: Line 26:
 The states can be summarised in this table: The states can be summarised in this table:
  
-^ Property Declaration          ^ Initial state ^ After assignment ^ After ''unset'' ^ After re-assignment ^+^ Property Declaration                 ^ Initial state ^ After assignment ^ After ''unset'' ^ After re-assignment ^
 | <php>#[AllowDynamicProperties]</php> | Undefined     | Defined, public  | Undefined       | Defined, public     | | <php>#[AllowDynamicProperties]</php> | Undefined     | Defined, public  | Undefined       | Defined, public     |
 | <php>private $foo;</php>             | ''null''      | Defined, private | ?               | Defined, private    | | <php>private $foo;</php>             | ''null''      | Defined, private | ?               | Defined, private    |
Line 45: Line 47:
  
 ^                          ^ ''var_dump'' output      ^ Error on read                                ^ Error on write ^ ^                          ^ ''var_dump'' output      ^ Error on read                                ^ Error on write ^
-True undefined           | Not shown                | "Undefined property"                         | "Creation of dynamic property" \\ (unless on ''stdClass'' or with ''#[AllowDynamicProperties]'') |+Not declared on class    | Not shown                | "Undefined property"                         | "Creation of dynamic property" \\ (unless on ''stdClass'' or with ''#[AllowDynamicProperties]'') |
 ^ Declared then ''unset''  | Not shown                | "Undefined property"                         | None | ^ Declared then ''unset''  | Not shown                | "Undefined property"                         | None |
 ^ Typed and uninitialized  | ''uninitialized(mixed)'' | "must not be accessed before initialization" | None | ^ Typed and uninitialized  | ''uninitialized(mixed)'' | "must not be accessed before initialization" | None |
  
 +==== Variance under inheritance ====
 +
 +In a "gradual typing" system such as PHP's, any type that is unspecified is usually analysed as though it has the widest possible type for that position. The language itself makes such an analysis for enforcing correct variance rules in inheritance:
 +
 +  * A method parameter with no type specified is equivalent to ''mixed''. As input is contravariant, sub-classes cannot change to any narrower type, but can freely add or omit the ''mixed'' keyword.
 +  * A method which specifies no //return type// may return any type, or the pseudo-types ''void'' and ''never''; conceptually, it returns ''mixed|void|never'' (although that union cannot be specified explicitly). As return types are covariant, sub-classes can change to any narrower type, including ''mixed''; but omitting the type if the parent specified it would be widening the type to ''mixed|void|never'', so is not allowed.
 +
 +Properties are //invariant// (as they can be both written to and read from), so sub-classes must declare them with an //exactly equivalent// type. As with parameters, the widest possible type is ''mixed'', but this is //not// currently considered equivalent.
 +
 +<code php>
 +class A {
 +    /** @var mixed $untyped */
 +    public $untyped;
 +    
 +    public mixed $mixed;
 +    
 +    /** @param mixed $a */
 +    public function acceptsUntyped($a) {}
 +    
 +    public function acceptsMixed(mixed $a) {}
 +    
 +    public function returnsMixed(): mixed {}
 +    
 +    /** @return mixed|void|never */
 +    public function returnsUntyped() {}
 +}
 +
 +class B extends A {
 +    # Not Allowed: considers "mixed" to be distinct from "untyped"
 +    public mixed $untyped;
 +    # Also Not Allowed
 +    public $mixed;
 +    
 +    # Allowed: unspecified parameter type is implicitly "mixed", so no variance occurs
 +    public function acceptsUntyped(mixed $a) {}
 +    # Also Allowed
 +    public function acceptsMixed($a) {}
 +    
 +    # Not Allowed: widens return type from "mixed" to implicit "mixed|void|never"
 +    public function returnsMixed() {}
 +    
 +    # Allowed: narrows return type from implicit "mixed|void|never" to explicit "mixed"
 +    # explicit return types of "void" and "never" can also be used here
 +    public function returnsUntyped(): mixed {}
 +}
 +</code>
  
 ===== Proposal ===== ===== Proposal =====
Line 69: Line 117:
 > Property %s::%s must not be accessed before initialization > Property %s::%s must not be accessed before initialization
  
-==== Declared properties will no longer default to null ====+==== Properties with no declared type will be analysed as ''mixed'' ====
  
-The remaining difference is the //initial// value of the property. We could in principle resolve this in two ways:+If no type is specified for a property, its type will be analysed as ''mixed'', as is the case with parameters.
  
-  - Initialize any property to ''null'' if that is a valid valueand it has no other initializer +Consequentlythe following code will be valid:
-  - Never initialize a property to ''null'' unless that initial value is explicitly specified+
  
-Option 1 has the advantage of not causing errors in any currently valid code; but it goes against the general trend of making the language stricter and more explicit.+<code php> 
 +class Parent { 
 +    public $foo; 
 +    public mixed $bar; 
 +
 +class Child extends Parent { 
 +    public mixed $foo; 
 +    public $bar; 
 +
 +</code>
  
-Option 2 is a non-trivial breaking change, for which a deprecation period will be needed. The proposal is therefore to **add the following deprecation notice in PHP 8.next**:+==== Properties with no declared type will continue to default to ''null'' ====
  
-> Accessing property %s::%s before initialization is deprecated+Whereas we are already committed to introducing an error for accessing an ''unset'' property, accessing a property without explicitly initialising it is probably very common. As such, the proposal is to keep the difference in initial value: **if a property has neither a type nor an initializer, treat it as though it had a type of ''mixed'' and an initializer of ''null''**. 
  
-The calling code will continue to see the value of the property as ''null''. Assigning a value, either in the declaration (<php>private $foo = null;</php>) or in procedural code (<php>$this->foo = null;</php>) will suppress this notice for all further accesses. +In other wordsgiven the following class:
- +
-(**TODO**: How can this be implemented efficiently?+
- +
-In PHP 9.0untyped properties will start in the uninitialized state unless an initial value is provided, and attempting to access them will give the error shown in the previous section. +
- +
-===== Backward Incompatible Changes ===== +
- +
-Un-typed properties will no longer have an implicit initial value of ''null''. To retain the existing behaviour, an explicit initializer must be added:+
  
 <code php> <code php>
-// Before +class {
-class Example {+
     public $foo;     public $foo;
 +    private $bar;
 +    protected $other = 42;
 } }
 +</code>
  
-// After +Act as though this was specified: 
-class Example + 
-    public $foo = null;+<code php> 
 +class 
 +    public mixed $foo = null
 +    private mixed $bar = null; 
 +    protected mixed $other = 42;
 } }
 </code> </code>
  
-===== Proposed PHP Version(s) =====+Note that this means adding the keyword ''mixed'' to a declaration may still change the behaviour of a program, since it will change the initial state to "uninitialized"; but it retains the behaviour of all //existing// code without any action from users.
  
-In PHP 8.next:+===== Backward Incompatible Changes =====
  
-  * Add deprecation notice for untyped properties accessed without initialization+It is possible to write code relying directly on the current behaviour of ''unset''. However, note that [[rfc:undefined_property_error_promotion]] already commits us to changing direct access to such a property from a Warning to an Error.
  
-In PHP 9.0:+===== Proposed PHP Version(s) =====
  
-  * All declared properties will start "uninitialized", rather than implicitly initialized to ''null'' +Since the new ''unset'' behaviour produces a fatal error for previously valid code, the change will be in the next major version, i.e. **PHP 9.0**
-  * All declared properties will become "uninitialized" when passed to ''unset'' +
-  * The message for accessing uninitialized properties will be changed to remove reference to "typed properties"+
  
 ===== RFC Impact ===== ===== RFC Impact =====
Line 120: Line 172:
 ==== To Opcache ==== ==== To Opcache ====
  
-The only behaviour which is not already present in the language is the deprecation of implicit ''null'' initializers.+No new functionality is added in the final proposed state. Indeed, some edge cases that currently need to be handled may be removed. 
 + 
 +==== To Reflection ====
  
-The other +**TODO** Reflection currently shows the implicit ''= null'' on properties, but distinguishes between "no type" and ''mixed'' in parameters, even though they are analysed as equivalent in variance checks. Should we change this? Was this discussed when ''mixed'' was introduced?
  
 ===== Open Issues ===== ===== Open Issues =====
  
-  * [[rfc:readonly_properties_v2|Readonly properties were restricted]] to typed properties, citing the default ''null'' initializer as rationale; should this restriction be explicitly relaxed as part of this RFC? 
   * Are there other differences between typed and untyped properties to address?   * Are there other differences between typed and untyped properties to address?
-  * Can uninitialized untyped properties be implemented efficiently during the deprecation period?+  * Is there a better compromise than keeping the implicit ''= null'' forever?
  
 ===== Unaffected PHP Functionality ===== ===== Unaffected PHP Functionality =====
Line 134: Line 187:
   * The behaviour of //dynamic// properties, that is those not defined at all in the class definition, is not changed by this RFC.   * The behaviour of //dynamic// properties, that is those not defined at all in the class definition, is not changed by this RFC.
   * The interaction of ''unset'' and magic ''%%__get%%'' is the same for untyped and typed properties, so will not be affected by harmonising them. Specifically, following <php>unset($foo->bar);</php> a subsequent read of <php>$foo->bar</php> will call <php>$foo->__get('bar')</php> if available: https://3v4l.org/D16Tv   * The interaction of ''unset'' and magic ''%%__get%%'' is the same for untyped and typed properties, so will not be affected by harmonising them. Specifically, following <php>unset($foo->bar);</php> a subsequent read of <php>$foo->bar</php> will call <php>$foo->__get('bar')</php> if available: https://3v4l.org/D16Tv
 +  * Unlike `var_dump($object)`, both `(array)$object` and `serialize($object)` already skip uninitialized properties, so the change in `unset()` behaviour will not affect code using these.
 +
 +===== Rejected Features =====
 +
 +==== Changing initial value of properties ====
 +
 +When considering the //initial// value of the property, to be fully consistent, we should do one of two things:
 +
 +  - Initialize any property to ''null'' if that is a valid value, and it has no other initializer
 +  - Never initialize a property to ''null'' unless that initial value is explicitly specified
 +
 +Option 1 has the advantage of not causing errors in any currently valid code; but it goes against the general trend of making the language stricter and more explicit.
 +
 +Option 2 is a non-trivial breaking change; although the edits to be made can be trivially automated (changing code of the form <php>public $foo;</php> to <php>public $foo = null;</php>), they will be very widespread. Users may rightly question the value of requiring such an edit.
 +
 +==== Allowing ''readonly'' on untyped properties ====
 +
 +As currently defined, [[rfc:readonly_properties_v2|''readonly'' properties]] cannot be written to once they have been initialized, so it does not make sense to allow one with an inline initializer. This also means they are currently required to have a specified type, since without one they are implicitly initialized to ''null''.
 +
 +Unfortunately, this restriction must remain if we are keeping the implicit ''null'' initializer.
  
 ===== Future Scope ===== ===== Future Scope =====
Line 156: Line 229:
   * [[rfc:deprecate_dynamic_properties|Deprecating "dynamic" properties]]   * [[rfc:deprecate_dynamic_properties|Deprecating "dynamic" properties]]
  
-===== Rejected Features =====+Previous mailing list threads touching on this issue: 
 + 
 +  * [[https://externals.io/message/117487#117487|Discussion thread for "Undefined Property Error Promotion" RFC]] 
 +  * [[https://externals.io/message/117644|Vote thread for that RFC]] 
 + 
 +===== Significant revisions =====
  
-None yet.+  * [[https://wiki.php.net/rfc/mixed_vs_untyped_properties?rev=1700166842|Version 0.3]], 2023-11-16: originally posted for discussion 
 +  * [[https://wiki.php.net/rfc/mixed_vs_untyped_properties?rev=1700521288|Version 0.5]], 2023-11-20: changed to keep null initializer as special case; added discussion of equivalence for type variance checks
rfc/mixed_vs_untyped_properties.1700159794.txt.gz · Last modified: 2023/11/16 18:36 by imsop