rfc:native_regular_expressions
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
rfc:native_regular_expressions [2014/08/15 01:13] – Add wiki formatting bishop | rfc:native_regular_expressions [2025/04/03 13:08] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
====== PHP RFC: Native Regular Expression ====== | ====== PHP RFC: Native Regular Expression ====== | ||
- | * Version: 0.1 | ||
* Date: 2014-08-13 | * Date: 2014-08-13 | ||
* Author: Bishop Bettini, bishop@php.net | * Author: Bishop Bettini, bishop@php.net | ||
* Status: Draft | * Status: Draft | ||
- | * First Published at: http:// | ||
FIXME FIXME FIXME | FIXME FIXME FIXME | ||
Jotting my ideas down here. Move along. Maybe called " | Jotting my ideas down here. Move along. Maybe called " | ||
+ | Consider emulating structure of https:// | ||
+ | https:// | ||
+ | |||
+ | |||
+ | ===== Introduction ===== | ||
+ | |||
+ | Regular expressions provide powerful string matching capabilities and play a critical role in most software written in PHP. For example, Github reports [[https:// | ||
+ | |||
+ | In the current engine, regular expressions are plain old strings: | ||
+ | |||
+ | <code php> | ||
+ | while (preg_match('/ | ||
+ | </ | ||
+ | |||
+ | The primary disadvantage with string representation comes when the regular expression itself needs to contain a single quote, double quote, or the delimiters bracketing the regular expression. | ||
+ | |||
+ | <code php> | ||
+ | // match foo in examples: =" | ||
+ | preg_match_all(' | ||
+ | </ | ||
+ | |||
+ | In some other languages, regular expressions are part of the language itself. | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | Another problem with regular expressions buried in plain old strings is that syntax highlighting becomes much more difficult. | ||
+ | |||
+ | |||
+ | <code php> | ||
+ | function getLinesFromFile($fileName) { | ||
+ | if (!$fileHandle = fopen($fileName, | ||
+ | return; | ||
+ | } | ||
+ | | ||
+ | while (false !== $line = fgets($fileHandle)) { | ||
+ | yield $line; | ||
+ | } | ||
+ | | ||
+ | fclose($fileHandle); | ||
+ | } | ||
+ | |||
+ | $lines = getLinesFromFile($fileName); | ||
+ | foreach ($lines as $line) { | ||
+ | // do something with $line | ||
+ | } | ||
+ | </ | ||
+ | |||
+ | The code looks very similar to the array-based implementation. The main difference is that instead of pushing | ||
+ | values into an array the values are '' | ||
+ | |||
+ | Generators work by passing control back and forth between the generator and the calling code: | ||
+ | |||
+ | When you first call the generator function ('' | ||
+ | but nothing of the code is actually executed. Instead the function directly returns a '' | ||
+ | '' | ||
+ | loop: | ||
+ | |||
+ | Whenever the '' | ||
+ | hits a '' | ||
+ | |||
+ | Generator methods, together with the '' | ||
+ | classes too: | ||
+ | |||
+ | <code php> | ||
+ | class Test implements IteratorAggregate { | ||
+ | protected $data; | ||
+ | | ||
+ | public function __construct(array $data) { | ||
+ | $this-> | ||
+ | } | ||
+ | | ||
+ | public function getIterator() { | ||
+ | foreach ($this-> | ||
+ | yield $key => $value; | ||
+ | } | ||
+ | // or whatever other traversation logic the class has | ||
+ | } | ||
+ | } | ||
+ | |||
+ | $test = new Test([' | ||
+ | foreach ($test as $k => $v) { | ||
+ | echo $k, ' => ', $v, " | ||
+ | } | ||
+ | </ | ||
+ | |||
+ | Generators can also be used the other way around, i.e. instead of producing values they can also consume them. When | ||
+ | used in this way they are often referred to as enhanced generators, reverse generators or coroutines. | ||
+ | |||
+ | Coroutines are a rather advanced concept, so it very hard to come up with not too contrived an short examples. | ||
+ | For an introduction see an example [[https:// | ||
+ | If you want to know more, I highly recommend checking out [[http:// | ||
+ | on this subject]]. | ||
+ | |||
+ | |||
+ | |||
+ | |||
+ | |||
New built-in " | New built-in " | ||
Line 45: | Line 141: | ||
====== Discussions ====== | ====== Discussions ====== | ||
* https:// | * https:// | ||
+ | * http:// | ||
rfc/native_regular_expressions.1408065182.txt.gz · Last modified: 2025/04/03 13:08 (external edit)