Dec
22
Функции для работы с регулярными выражениями (Perl-совместимые)
Функции для работы с регулярными выражениями (Perl-совместимые)
Содержание
- preg_grep — Возвращает массив вхождений, которые соответствуют шаблону
- preg_last_error — Returns the error code of the last PCRE regex execution
- preg_match_all — Выполняет глобальный поиск шаблона в строке
- preg_match — Выполняет проверку на соответствие регулярному выражению
- preg_quote — Экранирует символы в регулярных выражениях
- preg_replace_callback — Выполняет поиск по регулярному выражению и замену с использованием функции обратного вызова
- preg_replace — Выполняет поиск и замену по регулярному выражению
- preg_split — Разбивает строку по регулярному выражению
Коментарии
Something to bear in mind is that regex is actually a declarative programming language like prolog : your regex is a set of rules which the regex interpreter tries to match against a string. During this matching, the interpreter will assume certain things, and continue assuming them until it comes up against a failure to match, which then causes it to backtrack. Regex assumes "greedy matching" unless explicitly told not to, which can cause a lot of backtracking. A general rule of thumb is that the more backtracking, the slower the matching process.
It is therefore vital, if you are trying to optimise your program to run quickly (and if you can't do without regex), to optimise your regexes to match quickly.
I recommend the use of a tool such as "The Regex Coach" to debug your regex strings.
http://weitz.de/files/regex-coach.exe (Windows installer) http://weitz.de/files/regex-coach.tgz (Linux tar archive)
One comment about 5.2.x and the pcre.backtrack_limit:
Note that this setting wasn't present under previous PHP releases and the behaviour (or limit) under those releases was, in practise, higher so all these PCRE functions were able to "capture" longer strings.
With the arrival of the setting, defaulting to 100000 (less than 100K), you won't be able to match/capture strings over that size using, for example "ungreedy" modifiers.
So, in a lot of situations, you'll need to raise that (very small IMO) limit.
The worst part is that PHP simply won't match/capture those strings over pcre.backtrack_limit and will it be 100% silent about that (I think that throwing some NOTICE/WARNING if raised could help a lot to developers).
There is a lot of people suffering this changed behaviour from I've read on forums, bugs and so on).
Hope this note helps, ciao :-)
Writing regexp can be tedious and error prone. Test yours on https://regex101.com/ to make sure you haven't missed anything . It even colour codes example strings.