rfc:compact-object-property-assignment

This is an old revision of the document!


PHP RFC: Compact Object Property Assignment

COPA: A pragmatic approach to object literals

Introduction

Summary

This RFC proposes a new, compact syntax to assign values to multiple properties on an object in a single expression.

This pseudo object literal notation, (though not limited to such use) is intended to enable the developer to create an object and populating it inline, f.ex. directly as an argument in a function call.

As an alternative to writing a data structure as an associative array, COPA gives the data a documented signature so that you know what parameters are expected and their values.

You may find that COPA is best suited when you don’t mind public properties, don’t require a constructor and is willing to compromise on magic enforcement of mandatory properties, since as the saying goes: Object construction is a fuzzy concept in PHP and lazy initialisation is a feature!

Example

Let’s start with an example that demonstrates the essence of COPA.

Instead of doing this...

$myObj = new Foo(); // 1. Create
 
$myObj->a = 1; // 2. Populate
$myObj->b = 2;
$myObj->c = 3;
 
doTheFoo($myObj); // 3. Send

You will be able to do this:

doTheFoo((new Foo)->[
    a = 1,
    b = 2,
    c = 3,
]);

COPA is everything that comes after the object expression. You can use COPA on any expression that evaluates to an object. COPA is not tied to object construction, but can be used anytime, anywhere in the objects life, as many times as you want.

See more use cases below - and keep an open mind about the syntax; the options are limited these days :-)

Motivation

The purpose of this feature is to lighten the effort of creating, populating and sending data structures, from small to large ones.

The goal was to find a solution that meets the following criteria:

  • Brief - only mention the object once (less repetition)
  • Inline - object can be created and populated in a single expression (pseudo object literals, nested objects)
  • Typo-proof - property names can be autocompleted easily by IDE (faster typing, fewer errors)
  • Type-checking - IDE can verify correct type for typed properties and annotated virtual properties
  • Order-agnostic - properties can be specified in any order (though note that the order may change the result!)
  • Sparcity - any property can be “skipped” (“skipped” properties may acquire a default value)
  • Simple - does what you would expect without introducing any new concepts into the language

My focus is to find a pragmatic solution that can be implemented soon, but won’t trip up the continuing, inspiring development of the awesome language that is PHP.

If you have ever wanted to create, populate and send an object inside a function call, COPA is your chance!

Proposal

Syntax

The proposed syntax consists of the object arrow operator followed by a set of square brackets containing a comma-separated list of assignments in the form of property name equals expression:

<object-expression> -> [
    `<property-name> = <expression>`,
    [repeat as needed],
]

A trailing comma is permitted for the same reasons it's permitted in array literals and function calls (as of PHP 7.3).

The whole block is considered an expression that returns the object before the arrow.

This syntax was chosen for its availability in the language. If we land on another syntax, I’m not married to this one. The only criteria are that it doesn’t conflict with anything else, that it is brief and feels good.

Interpretation

Each comma-separated assignment inside the brackets is executed as an assignment of the named property on the object preceding the block. If the property is defined and publicly accessible, it will simply be set, or possible throw a TypeError. If there's no property with that name, or if it's protected or private, the magic method __set will be called just like you would expect. When used in an expression, COPA simply returns the object itself.

If you replace COPA with single line assignments, you will always get the same result, f.ex.:

$foo->[
    a = 1,
    b = myfunc(),
    c = $foo->bar(),
];
 
// The COPA above is identical to
$foo->a = 1;
$foo->b = myfunc();
$foo->c = $foo->bar();

Use cases

Alternative to passive associative arrays

// Instead of this:
 
doSomething([
    'a' => 1, // Array syntax doesn't provide any help on parameter names
    'b' => 2, // or types
]);
 
// Use COPA:
 
class Options {
    public $a;
    public $b;
}
 
doSomething((new Options)->[
    a = 1, // Parameter name and type checking
    b = 2,
]);

If you often create, populate and send the same families of data structure, declaring those structures and using COPA makes it a breeze.

DTOs - data transfer objects

Typical characteristics of DTOs:

  • many properties
  • properties may be optional, with default values
  • public visibility on properties, i.e. no desire for boilerplate code to create setters and getters for each one
  • desirability to create, populate and send in one go
With current syntax
class FooDto {
    public ?string $mane = null;
    public int $padme = 1; // Optional, with default
    public ?FooDto $hum = null;
}
 
$foo = new FooDto(); // Instantiating the object first
$foo->mane = 'get'; // Setting the properties
// Skipping the $padme property which has a default value
$foo->hum = new FooDto(); // Creating a nested DTO
$foo->hum->mane = 'life'; // Populating the nested DTO
 
doTheFoo($foo); // Passing it to a function
With new COPA syntax
doTheFoo((new FooDto)->[ // Constructing and populating inline
    mane = 'get',
    hum = (new FooDto)->[ // Even nested structures
        mane = 'life',
    ],
]);

Though the example is not a typical DTO, it represents the characteristics.

Argument bags

Argument bags are typically used when:

  • many arguments needs to be passed to a function
  • some arguments are optional
  • order of arguments is not important

With the proposed new syntax we can avoid using simple arrays and instead get autocomplete and type-checking in the IDE with a syntax that smells of named parameters:

With current syntax
class Foo {
    protected string $mane;
    protected int $padme;
    protected string $hum;
 
    public function __construct(array $options) {
        $this->mane = $options['mane'];
        $this->padme = $options['padme'] ?? 1;
        $this->hum = $options['hum'];
    }
}
 
$myFoo = new Foo([
    'mane' => 'get', // Array syntax doesn't provide any help on parameter names
    'hum' => 'life', // or types
]);
With new COPA syntax
class FooOptions { // Separate concerns into an options class that handles optional and default values...
    public ?string $mane = null;
    public int $padme = 1; // Optional, with default
    public ?string $hum = null;
}
 
class Foo { // And the main class that receives the options and handles some feature
    protected FooOptions $options;
 
    public function __construct(FooOptions $options) {
        // Do some validate here if you must, f.ex. checking for mandatory parameters
        $this->options = $options;
    }
}
 
$myFoo = new Foo((new FooOptions)->[ // Objects as argument bags (pseudo named parameters?)
    mane = 'get', // Parameter name and type checking
    hum = 'life',
]);

The other alternative to an argument bag is usually a constructor with many arguments, which is something that has been attempted to solve with RFCs arguing for automatic promotion of arguments to properties (f.ex. RFC: Automatic Property Initialization, but which is probably also better left to the COPA argument bag example above.

Special cases

Clarification of edge-case behavior.

Execution order

The fact that the assignments are executed in the order they are listed (just as if they had been specified on separate lines), has the following consequence:

$myObj->[
    foo = 10,
    bar = $myObj->foo + 20,
];
 
var_dump($myObj->bar); // int(30)

As the assignments are carried out in order on the object, you can use the new value of a previous assigment in a following one.

There may be arguments equally for and against this behavior, but ultimately the simplicity of the design and implementation wins, in my opinion.

Exceptions

If an expression inside a COPA block throws an exception, the result is the same as if the assignments had been done the old way, f.ex. if we have:

class Foo {
    public $a;
    public $b;
    public $c;
}
 
$foo = new Foo();
 
function iThrow() {
    throw new \Exception();
}

Then the following examples behave identically:

// With COPA
try {
    $foo->[
        a = 'a',
        b = iThrow(),
        c = 'c',
    ];
} catch (\Throwable $e) {
    var_dump($foo);
}
// Without COPA
try {
    $foo->setA('a')
        ->setB(iThrow())
        ->setC('c');
} catch (\Throwable $e) {
    var_dump($foo);
}
 
// OR
 
try {
    $foo->a = 'a';
    $foo->b = iThrow();
    $foo->c = 'c';
} catch (\Throwable $e) {
    var_dump($foo);
}

The result in all cases is that a will be set, while b and c will not:

object(Foo)#1 (3) {
  ["a"]=>
  string(1) "a"
  ["b"]=>
  NULL
  ["c"]=>
  NULL
}

I.e. COPA is not an atomic operation in the same way method chaining isn’t.

Out of scope / future scope

This section contains a tentative list of features that may not be implemented.

Can you do that?

The following examples show some things that is now possible using regular property accessor, but which will not also be supported with COPA:

$p = 'foo';
$myObj->$p = 'bar'; // Variable property name
$a->{"fo" . "o"} = 'baz'; // Property name generated from expression
$a->b->c = 'hum'; // Creating default object from empty value
$a->d['e'] = 'dear'; // Setting array element inside property
$a->f++; // Increment/decrement of property value
 
$myObj->[
    $p = 'bar', // Syntax error
    {"foo"} = 'bar', // Syntax error
    b->c = 'hum', // Syntax error - but see Nested COPA below...
    d['e'] = 'dear', // Syntax
    f++, // Syntax error
];

If anyone can show that any these features would be significantly desirable and simultaneously rather trivial to implement, let’s discuss.

Nested COPA

It might be nice to be able to populate an existing nested object in the same block:

// This example, using current syntax...
$foo->a = 1;
$foo->b->c = 2;
 
// Could be written with COPA like this:
$foo->[
    a = 1,
    b->[
        c = 2,
    ],
];
 
// But for now you'll have to do this:
$foo->[
    a = 1,
    b = $foo->b->[
        c = 2,
    ],
];

Backward Incompatible Changes

None. Array followed by square bracket causes syntax error in PHP 7.4. This new syntax is optional. If you don't use it, your code will continue to run.

Proposed PHP Version(s)

PHP 8.0

Open Issues

Alternative syntaxes

I’m going to suggest some alternative syntaxes, that we can vote on, provided their feasibility has been vetted by an experienced internals developer:

Syntax A

This is the originally proposed one:

$foo->[
    a = 1,
    b = 2,
    c = (new Foo)->[
        a = 3,
        b = 4,
    ],
];

Syntax B

Since the deprecation of curly brackets as array access in PHP 7.4, that notation could be used to assign properties:

$foo{
    a = 1,
    b = 2,
    c = (new Foo){
        a = 3,
        b = 4,
    },
};

Syntax C

No wrapper:

$foo->
    a = 1,
    b = 2,
    c = (new Foo)->
        a = 3,
        b = 4,
    ;,
;

Nesting becomes awkward - how do we jump out again?

Syntax D

Repeating the array for familiarity:

$foo
    ->a = 1,
    ->b = 2,
    ->c = (new Foo)
        ->a = 3,
        ->b = 4,
    ;,
;

Same issue with nested as previous. We need to find a way to express end of COPA block.

Syntax E

Like the original but with normal brackets instead of square ones:

$foo->(
    a = 1,
    b = 2,
    c = (new Foo)->(
        a = 3,
        b = 4,
    ),
);

Mandatory properties

Some criticism has been that COPA is of little use without also enforcing mandatory properties to be set.

Rowan Tommins:

It seems pretty rare that an object would have no mandatory properties, so saying “if you have a mandatory property, COPA is not for you” is ruling out a lot of uses.

Michał Brzuchalski:

This helps to avoid bugs where a property is added to the class but forgot to be assigned it a value in all cases where the class is instantiated and initialized

Mandatory properties are typed properties without a default value. They are in the uninitialized state until they are assigned a value. It has been suggested that an exception should be thrown at the end of the constructor if any property is still uninitialized, but this idea has not yet caught on. COPA doesn’t have any obvious way of enforcing mandatory properties.

For now you must continue to write your own validation code to be carried out at the appropriate “point of no return”.

Atomic operations

It’s also been suggested that assigning multiple values using COPA should be an atomic operation that either succeeds or fails in its entirety.

That does sound cool as well, and may seem like the expected behavior for some.

Still, I’m not convinved it’s worth the extra hassle, since what were you planning to do with the incomplete object anyway?

Chaining method calls are not an atomic operation and if an exception is thrown in the middle I doubt you would raise an eyebrow about the previous call having altered the state of the object.

Proposed Voting Choices

The primary vote of whether or not to accept this RFC requires a 2/3 majority.

A secondary “vote” directed at no-voters, will ask you the primary reason for voting “No”.

The options will be:

  1. I voted yes!
  2. I don’t find the feature useful
  3. I don’t like the syntax
  4. I prefer a more comprehensive solution to this problem
  5. I prefer a narrower solution to this problem
  6. This breaks backwards compatibility
  7. This will negatively limit future changes
  8. This will be a nightmare to implement and maintain
  9. I prefer not to say

This will help understand what the obstacles are, when studying this RFC in the future, should anyone be tempted to have another shot at object literals et. al.

Patches and Tests

There are yet no patches nor tests. The question of who will be developing this will be addressed if the RFC passes.

Implementation

After the project is implemented, this section should contain

  1. the version(s) it was merged into
  2. a link to the git commit(s)
  3. a link to the PHP manual entry for the feature
  4. a link to the language specification section (if any)

References

rfc/compact-object-property-assignment.1584847043.txt.gz · Last modified: 2020/03/22 03:17 by jgivoni