rfc:compact-object-property-assignment

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revisionPrevious revision
Next revision
Previous revision
Next revisionBoth sides next revision
rfc:compact-object-property-assignment [2020/03/12 01:41] jgivonirfc:compact-object-property-assignment [2020/03/22 03:17] jgivoni
Line 1: Line 1:
 ====== PHP RFC: Compact Object Property Assignment ====== ====== PHP RFC: Compact Object Property Assignment ======
-  * Version: 0.9 + 
-  * Date: 2020-03-10+**COPA: A //pragmatic// approach to object literals** 
 + 
 +  * Version: 1.3 
 +  * Date: 2020-03-17
   * Author: Jakob Givoni <jakob@givoni.dk>   * Author: Jakob Givoni <jakob@givoni.dk>
-  * Status: Draft+  * Status: Under [[https://externals.io/message/109055|Discussion]]
  
 ===== Introduction ===== ===== Introduction =====
  
 ==== Summary ==== ==== Summary ====
-**A pragmatic approach to mimicking object literals.** 
  
 This RFC proposes a new, compact syntax to assign values to multiple properties on an object in a single expression. This RFC proposes a new, compact syntax to assign values to multiple properties on an object in a single expression.
 +
 +This //**pseudo object literal**// notation, (though not limited to such use) is intended to enable the developer to //**create an object and populating it inline**//, f.ex. directly as an argument in a function call.
 +
 +As an alternative to writing a data structure as an associative array, COPA gives the data a **//documented signature//** so that you know what parameters are expected and their values.
 +
 +You may find that COPA is best suited when you don’t mind public properties, don’t require a constructor and is willing to compromise on magic enforcement of mandatory properties, since as [[https://wiki.php.net/rfc/write_once_properties#read-only_semantics|the saying goes]]: //Object construction is a fuzzy concept in PHP and lazy initialisation is a feature!//
  
 ==== Example ==== ==== Example ====
 +
 +Let’s start with an example that demonstrates the essence of COPA.
  
 === Instead of doing this... === === Instead of doing this... ===
 +
 <code php> <code php>
-$myObj->prop_a 1; +$myObj = new Foo(); // 1. Create
-$myObj->prop_b = 2; +
-$myObj->prop_c = 3; +
-</code>+
  
 +$myObj->a = 1; // 2. Populate
 +$myObj->b = 2;
 +$myObj->c = 3;
 +
 +doTheFoo($myObj); // 3. Send
 +</code>
 === You will be able to do this: === === You will be able to do this: ===
 +
 <code php> <code php>
-$myObj->[ +doTheFoo((new Foo)->[ 
-    prop_a = 1, +    = 1, 
-    prop_b = 2, +    = 2, 
-    prop_c = 3, +    = 3, 
-];+]);
 </code> </code>
 +COPA is everything that comes after the object expression. You can use COPA on any expression that evaluates to an object. COPA is not tied to object construction, but can be used anytime, anywhere in the objects life, as many times as you want.
 +
 +//See more use cases below - and keep an open mind about the syntax; the options are limited these days :-)//
  
 ==== Motivation ==== ==== Motivation ====
-The purpose of this feature is to lighten the effort of populating data structures, especially medium to large ones. 
  
-Ideally the solution should meets the following criteria:+The purpose of this feature is to lighten the effort of creating, populating and sending data structures, from small to large ones. 
 + 
 +The goal was to find a solution that meets the following criteria:
  
   * **Brief** - only mention the object once (less repetition)   * **Brief** - only mention the object once (less repetition)
Line 39: Line 58:
   * **Typo-proof** - property names can be autocompleted easily by IDE (faster typing, fewer errors)   * **Typo-proof** - property names can be autocompleted easily by IDE (faster typing, fewer errors)
   * **Type-checking** - IDE can verify correct type for typed properties and annotated virtual properties   * **Type-checking** - IDE can verify correct type for typed properties and annotated virtual properties
-  * **Order-agnostic** - properties can be specified in any order (this doesn't mean that the result is necessarily the same for any ordering, as that will depend on the object implementation+  * **Order-agnostic** - properties can be specified in any order (though note that the order //may// change the result!
-  * **Sparcity** - there's no obligation to specify any particular property (non-specified properties may be given a default value)+  * **Sparcity** - any property can be “skipped” (“skipped” properties may acquire a default value)
   * **Simple** - does what you would expect without introducing any new concepts into the language   * **Simple** - does what you would expect without introducing any new concepts into the language
 +
 +//My focus is to find a **pragmatic** solution that can be implemented soon, but won’t trip up the continuing, inspiring development of the awesome language that is PHP.//
 +
 +//If you have ever wanted to create, populate and send an object inside a function call, COPA is your chance!//
  
 ===== Proposal ===== ===== Proposal =====
  
 ==== Syntax ==== ==== Syntax ====
-The proposed syntax following an object expression ''$myObj'' | ''(new MyClass())'' is the object arrow operator ''->'' followed by a set of square brackets ''[…]'' containing a comma-separated list of property name equal ''='' expression. + 
-A trailing comma is permitted for the same reasons it's permitted in array literals and function calls (as of PHP 7.3). +The proposed syntax consists of the object arrow operator followed by a set of square brackets containing a comma-separated list of assignments in the form of property name equals expression: 
-The whole block is considered an expression that returns the object we started with.+ 
 +<code> 
 +<object-expression> -> [ 
 +    `<property-name> <expression>`, 
 +    [repeat as needed], 
 +
 +</code> 
 +A trailing comma is permitted for the same reasons it's permitted in array literals and [[https://wiki.php.net/rfc/trailing-comma-function-calls|function calls (as of PHP 7.3)]]. 
 + 
 +The whole block is considered an expression that returns the object before the arrow. 
 + 
 +This syntax was chosen for its availability in the language. If we land on another syntax, I’m not married to this one. The only criteria are that it doesn’t conflict with anything else, that it is brief and feels good.
  
 ==== Interpretation ==== ==== Interpretation ====
-Each comma-separated assignment inside the curly brackets is executed as an assignment of the named property on the object preceding the block. If the property is defined and publicly accessible, it will simply be set, or possible throw a TypeError. If there's no property with that name, or if it's protected or private, the magic method '' _ _ set'' will be called just like you would expect. 
-When used in an expression, COPA simply returns the object itself. 
  
 +Each comma-separated assignment inside the brackets is executed as an assignment of the named property on the object preceding the block. If the property is defined and publicly accessible, it will simply be set, or possible throw a ''%%TypeError%%''. If there's no property with that name, or if it's protected or private, the magic method ''%%__set%%'' will be called just like you would expect. When used in an expression, **COPA** simply returns the object itself.
 +
 +If you replace COPA with single line assignments, you will always get the same result, f.ex.:
 +
 +<code php>
 +$foo->[
 +    a = 1,
 +    b = myfunc(),
 +    c = $foo->bar(),
 +];
 +
 +// The COPA above is identical to
 +$foo->a = 1;
 +$foo->b = myfunc();
 +$foo->c = $foo->bar();
 +</code>
 ==== Use cases ==== ==== Use cases ====
 +
 +=== Alternative to passive associative arrays ===
 +
 +<code php>
 +// Instead of this:
 +
 +doSomething([
 +    'a' => 1, // Array syntax doesn't provide any help on parameter names
 +    'b' => 2, // or types
 +]);
 +
 +// Use COPA:
 +
 +class Options {
 +    public $a;
 +    public $b;
 +}
 +
 +doSomething((new Options)->[
 +    a = 1, // Parameter name and type checking
 +    b = 2,
 +]);
 +</code>
 +//If you often create, populate and send the same families of data structure, declaring those structures and using COPA makes it a breeze.//
  
 === DTOs - data transfer objects === === DTOs - data transfer objects ===
-  * many properties 
-  * some properties may be optional or have default values 
-  * properties may be public so we may avoid writing a lot of boilerplate code when defining them 
  
-With COPA we can create the whole object inline right inside the function call:+Typical characteristics of DTOs: 
 + 
 +  * many properties 
 +  * properties may be optional, with default values 
 +  * public visibility on properties, i.e. no desire for boilerplate code to create setters and getters for each one 
 +  * desirability to create, populate and send in one go
  
 == With current syntax == == With current syntax ==
 +
 <code php> <code php>
-class Dto +class FooDto 
-    public string $foo+    public ?string $mane = null
-    public int $bar = 1; // Optional, with default +    public int $padme = 1; // Optional, with default 
-    public string $baz;+    public ?FooDto $hum = null;
 } }
  
-$myDto = new Dto(); // Instantiating the object first +$foo = new FooDto(); // Instantiating the object first 
-$myDto->foo = 'get'; // Setting the properties +$foo->mane = 'get'; // Setting the properties 
-$myDto->baz = 'life';+// Skipping the $padme property which has a default value 
 +$foo->hum = new FooDto(); // Creating a nested DTO 
 +$foo->hum->mane = 'life'; // Populating the nested DTO
  
-myFunc($myDto); // Passing to a function+doTheFoo($foo); // Passing it to a function
 </code> </code>
 +== With new COPA syntax ==
  
-== With new syntax == 
 <code php> <code php>
-myFync((new Dto())->[ // Constructing and populating inline +doTheFoo((new FooDto)->[ // Constructing and populating inline 
-    foo = 'get', +    mane = 'get', 
-    baz = 'life',+    hum = (new FooDto)->[ // Even nested structures 
 +        mane = 'life'
 +    ],
 ]); ]);
 </code> </code>
 +//Though the example is not a typical DTO, it represents the characteristics.//
  
 === Argument bags === === Argument bags ===
-  * many arguments needs to be passed to function+ 
 +Argument bags are typically used when: 
 + 
 +  * many arguments needs to be passed to function
   * some arguments are optional   * some arguments are optional
   * order of arguments is not important   * order of arguments is not important
-  + 
-With COPA we can avoid using simple arrays and instead get autocomplete and type-checking in the IDE with a syntax that smells of named parameters:+With the proposed new syntax we can **avoid using simple arrays** and instead get **autocomplete** and **type-checking** in the IDE with a syntax that smells of **named parameters**:
  
 == With current syntax == == With current syntax ==
 +
 <code php> <code php>
 class Foo { class Foo {
-    protected string $foo+    protected string $mane
-    protected int $bar+    protected int $padme
-    protected string $baz;+    protected string $hum;
  
     public function __construct(array $options) {     public function __construct(array $options) {
-        $this->foo = $options['foo']; +        $this->mane = $options['mane']; 
-        $this->bar = $options['bar'] ?? 1; +        $this->padme = $options['padme'] ?? 1; 
-        $this->baz = $options['baz'];+        $this->hum = $options['hum'];
     }     }
 } }
  
 $myFoo = new Foo([ $myFoo = new Foo([
-    'foo' => 'get', // Array syntax argument bag doesn't provide any help on parameter names  +    'mane' => 'get', // Array syntax doesn't provide any help on parameter names 
-    'baz' => 'life', // or types+    'hum' => 'life', // or types
 ]); ]);
 </code> </code>
 +== With new COPA syntax ==
  
-== Alternatively, with current syntax == 
-<code php> 
-class Foo { 
-    protected string $foo; 
-    protected int $bar; 
-    protected string $baz; 
- 
-    public function __construct(string $foo, int $bar, string $baz) { 
-        $this->foo = $foo; 
-        $this->bar = $bar; 
-        $this->baz = $baz; 
-    } 
-} 
- 
-$myFoo = new Foo('get', 1, 'life'); // Dealing with optional parameters and default values is not straight forward 
-</code> 
- 
-== With new syntax == 
 <code php> <code php>
 class FooOptions { // Separate concerns into an options class that handles optional and default values... class FooOptions { // Separate concerns into an options class that handles optional and default values...
-    public string $foo+    public ?string $mane = null
-    public int $bar = 1; // Optional, with default +    public int $padme = 1; // Optional, with default 
-    public string $baz;+    public ?string $hum = null;
 } }
  
Line 142: Line 209:
  
     public function __construct(FooOptions $options) {     public function __construct(FooOptions $options) {
 +        // Do some validate here if you must, f.ex. checking for mandatory parameters
         $this->options = $options;         $this->options = $options;
     }     }
 } }
  
-$myFoo = new Foo((new FooOptions())->[ +$myFoo = new Foo((new FooOptions)->// Objects as argument bags (pseudo named parameters?) 
-    foo = 'get', // Parameter name and type checking +    mane = 'get', // Parameter name and type checking 
-    baz = 'life',+    hum = 'life',
 ]); ]);
 </code> </code>
 +//The other alternative to an argument bag is usually a constructor with many arguments, which is something that has been attempted to solve with RFCs arguing for automatic promotion of arguments to properties (f.ex. [[https://wiki.php.net/rfc/automatic_property_initialization|RFC: Automatic Property Initialization]], but which is probably also better left to the COPA argument bag example above.//
  
 ==== Special cases ==== ==== Special cases ====
 +
 +Clarification of edge-case behavior.
  
 === Execution order === === Execution order ===
-The fact that the assignments are executed in the order they are listedjust as if they had been specified on separate lines, has the following consequence:+ 
 +The fact that the assignments are executed in the order they are listed (just as if they had been specified on separate lines), has the following consequence: 
 <code php> <code php>
 $myObj->[ $myObj->[
Line 164: Line 237:
 var_dump($myObj->bar); // int(30) var_dump($myObj->bar); // int(30)
 </code> </code>
 +//As the assignments are carried out in order on the object, you can use the new value of a previous assigment in a following one.//
  
 //There may be arguments equally for and against this behavior, but ultimately the simplicity of the design and implementation wins, in my opinion.// //There may be arguments equally for and against this behavior, but ultimately the simplicity of the design and implementation wins, in my opinion.//
  
-=== Expressions in property names === +=== Exceptions === 
-Property names must be expressed literallySome examples of what's possible in a regular assignmentbut won't be allowed inside COPA:+ 
 +If an expression inside a COPA block throws an exception, the result is the same as if the assignments had been done the old way, f.ex. if we have: 
 + 
 +<code php> 
 +class Foo { 
 +    public $a; 
 +    public $b; 
 +    public $c; 
 +
 + 
 +$foo = new Foo(); 
 + 
 +function iThrow() { 
 +    throw new \Exception(); 
 +
 +</code> 
 +Then the following examples behave identically: 
 + 
 +<code php> 
 +// With COPA 
 +try { 
 +    $foo->
 +        a = 'a', 
 +        b = iThrow(), 
 +        c = 'c', 
 +    ]; 
 +} catch (\Throwable $e) { 
 +    var_dump($foo); 
 +
 + 
 +</code> 
 +<code php> 
 +// Without COPA 
 +try { 
 +    $foo->setA('a'
 +        ->setB(iThrow()) 
 +        ->setC('c'); 
 +} catch (\Throwable $e) { 
 +    var_dump($foo); 
 +
 + 
 +// OR 
 + 
 +try { 
 +    $foo->a = 'a'; 
 +    $foo->b = iThrow(); 
 +    $foo->c = 'c'; 
 +} catch (\Throwable $e) { 
 +    var_dump($foo); 
 +
 +</code> 
 +The result in all cases is that ''%%a%%'' will be setwhile ''%%b%%'' and ''%%c%%'' will not: 
 + 
 +<code php> 
 +object(Foo)#1 (3) { 
 +  ["a"]=> 
 +  string(1) "a" 
 +  ["b"]=> 
 +  NULL 
 +  ["c"]=> 
 +  NULL 
 +
 +</code> 
 +//I.e. COPA is not an atomic operation in the same way method chaining isn’t.// 
 + 
 +==== Out of scope / future scope ==== 
 + 
 +This section contains a tentative list of features that may not be implemented. 
 + 
 +=== Can you do that? === 
 + 
 +The following examples show some things that is now possible using regular property accessor, but which will not also be supported with COPA:
  
 <code php> <code php>
 $p = 'foo'; $p = 'foo';
 $myObj->$p = 'bar'; // Variable property name $myObj->$p = 'bar'; // Variable property name
-$a->{"foo"} = 'baz'; // Property name generated from expression +$a->{"fo" . "o"} = 'baz'; // Property name generated from expression 
-    +$a->b->c = 'hum'; // Creating default object from empty value 
 +$a->d['e'] = 'dear'; // Setting array element inside property 
 +$a->f++; // Increment/decrement of property value 
 $myObj->[ $myObj->[
     $p = 'bar', // Syntax error     $p = 'bar', // Syntax error
     {"foo"} = 'bar', // Syntax error     {"foo"} = 'bar', // Syntax error
 +    b->c = 'hum', // Syntax error - but see Nested COPA below...
 +    d['e'] = 'dear', // Syntax
 +    f++, // Syntax error
 ]; ];
 </code> </code>
 +//If anyone can show that any these features would be significantly desirable and simultaneously rather trivial to implement, let’s discuss.//
  
-//Supporting these expressions would probably be tricky to implement because the property names are not quoted, so they are not represented as literal strings, which makes them look like constants and complicates parsing in general, I believe. It also defeats the purpose of name and type checking, and is not a feature you'd miss in object literals either.//+=== Nested COPA ===
  
 +It might be nice to be able to populate an existing nested object in the same block:
 +
 +<code php>
 +// This example, using current syntax...
 +$foo->a = 1;
 +$foo->b->c = 2;
 +
 +// Could be written with COPA like this:
 +$foo->[
 +    a = 1,
 +    b->[
 +        c = 2,
 +    ],
 +];
 +
 +// But for now you'll have to do this:
 +$foo->[
 +    a = 1,
 +    b = $foo->b->[
 +        c = 2,
 +    ],
 +];
 +
 +</code>
 ===== Backward Incompatible Changes ===== ===== Backward Incompatible Changes =====
-None. Array followed by square bracket causes syntax error in PHP 7.4. + 
-This new syntax is optional. If you don't use it, your code will continue to run.+None. Array followed by square bracket causes syntax error in PHP 7.4. This new syntax is optional. If you don't use it, your code will continue to run.
  
 ===== Proposed PHP Version(s) ===== ===== Proposed PHP Version(s) =====
-Next PHP 8.x+ 
 +PHP 8.0
  
 ===== Open Issues ===== ===== Open Issues =====
 +
 +==== Alternative syntaxes ====
 +
 +I’m going to suggest some alternative syntaxes, that we can vote on, provided their feasibility has been vetted by an experienced internals developer:
 +
 +=== Syntax A ===
 +
 +This is the originally proposed one:
 +
 +<code php>
 +$foo->[
 +    a = 1,
 +    b = 2,
 +    c = (new Foo)->[
 +        a = 3,
 +        b = 4,
 +    ],
 +];
 +</code>
 +=== Syntax B ===
 +
 +Since the [[https://wiki.php.net/rfc/deprecate_curly_braces_array_access|deprecation of curly brackets as array access in PHP 7.4]], that notation could be used to assign properties:
 +
 +<code php>
 +$foo{
 +    a = 1,
 +    b = 2,
 +    c = (new Foo){
 +        a = 3,
 +        b = 4,
 +    },
 +};
 +</code>
 +=== Syntax C ===
 +
 +No wrapper:
 +
 +<code php>
 +$foo->
 +    a = 1,
 +    b = 2,
 +    c = (new Foo)->
 +        a = 3,
 +        b = 4,
 +    ;,
 +;
 +</code>
 +Nesting becomes awkward - how do we jump out again?
 +
 +=== Syntax D ===
 +
 +Repeating the array for familiarity:
 +
 +<code php>
 +$foo
 +    ->a = 1,
 +    ->b = 2,
 +    ->c = (new Foo)
 +        ->a = 3,
 +        ->b = 4,
 +    ;,
 +;
 +</code>
 +Same issue with nested as previous. We need to find a way to express end of COPA block.
 +
 +=== Syntax E ===
 +
 +Like the original but with normal brackets instead of square ones:
 +
 +<code php>
 +$foo->(
 +    a = 1,
 +    b = 2,
 +    c = (new Foo)->(
 +        a = 3,
 +        b = 4,
 +    ),
 +);
 +</code>
 +==== Mandatory properties ====
 +
 +Some criticism has been that COPA is of little use without also enforcing mandatory properties to be set.
 +
 +Rowan Tommins:
 +
 +> It seems pretty rare that an object would have no mandatory properties, so saying “if you have a mandatory property, COPA is not for you” is ruling out a lot of uses.
 +
 +Michał Brzuchalski:
 +
 +> This helps to avoid bugs where a property is added to the class but forgot to be assigned it a value in all cases where the class is instantiated and initialized
 +
 +Mandatory properties are typed properties without a default value. They are in the uninitialized state until they are assigned a value. It has been suggested that an exception should be thrown at the end of the constructor if any property is still uninitialized, but this idea has not yet caught on. COPA doesn’t have any obvious way of enforcing mandatory properties.
 +
 +For now you must continue to write your own validation code to be carried out at the appropriate “point of no return”.
 +
 +==== Atomic operations ====
 +
 +It’s also been suggested that assigning multiple values using COPA should be an atomic operation that either succeeds or fails in its entirety.
 +
 +That does sound cool as well, and may seem like the expected behavior for some.
 +
 +Still, I’m not convinved it’s worth the extra hassle, since what were you planning to do with the incomplete object anyway?
 +
 +Chaining method calls are not an atomic operation and if an exception is thrown in the middle I doubt you would raise an eyebrow about the previous call having altered the state of the object.
  
 ===== Proposed Voting Choices ===== ===== Proposed Voting Choices =====
 +
 +The primary vote of whether or not to accept this RFC requires a 2/3 majority.
 +
 +A secondary “vote” directed at no-voters, will ask you the primary reason for voting “No”.
 +
 +The options will be:
 +
 +  - I voted yes!
 +  - I don’t find the feature useful
 +  - I don’t like the syntax
 +  - I prefer a more comprehensive solution to this problem
 +  - I prefer a narrower solution to this problem
 +  - This breaks backwards compatibility
 +  - This will negatively limit future changes
 +  - This will be a nightmare to implement and maintain
 +  - I prefer not to say
 +
 +This will help understand what the obstacles are, when studying this RFC in the future, should anyone be tempted to have another shot at object literals et. al.
  
 ===== Patches and Tests ===== ===== Patches and Tests =====
 +
 There are yet no patches nor tests. The question of who will be developing this will be addressed if the RFC passes. There are yet no patches nor tests. The question of who will be developing this will be addressed if the RFC passes.
  
 ===== Implementation ===== ===== Implementation =====
-After the project is implemented, this section should contain + 
 +After the project is implemented, this section should contain 
   - the version(s) it was merged into   - the version(s) it was merged into
   - a link to the git commit(s)   - a link to the git commit(s)
Line 205: Line 507:
  
 ===== References ===== ===== References =====
 +
 +Related RFCs:
 +
   * https://wiki.php.net/rfc/object-initializer   * https://wiki.php.net/rfc/object-initializer
   * https://wiki.php.net/rfc/objectarrayliterals   * https://wiki.php.net/rfc/objectarrayliterals
   * https://wiki.php.net/rfc/simplified_named_params   * https://wiki.php.net/rfc/simplified_named_params
   * https://wiki.php.net/rfc/named_params   * https://wiki.php.net/rfc/named_params
- +  * https://wiki.php.net/rfc/code_free_constructor 
-===== Rejected Features =====+  * https://wiki.php.net/rfc/constructor-promotion 
 +  * https://wiki.php.net/rfc/automatic_property_initialization 
 +  * https://wiki.php.net/rfc/skipparams
  
rfc/compact-object-property-assignment.txt · Last modified: 2020/04/14 06:30 by jgivoni