re正则表达式16_managing complex regexes

zoukankan html css js c++ java

re正则表达式16_managing complex regexes
Managing Complex Regexes

Regular expressions are fine if the text pattern you need to match is simple. But matching complicated text patterns might require long, convoluted regular expressions. You can mitigate this by telling the re.compile() function to ignore whitespace and comments inside the regular expression string. This “verbose mode” can be enabled by passing the variable re.VERBOSE as the second argument tore.compile().

Now instead of a hard-to-read regular expression like this:
```
phoneRegex = re.compile(r'((d{3}|(d{3}))?(s|-|.)?d{3}(s|-|.)d{4}
(s*(ext|x|ext.)s*d{2,5})?)')
```
you can spread the regular expression over multiple lines with comments like this:
```
phoneRegex = re.compile(r'''(
    (d{3}|(d{3}))?            # area code
    (s|-|.)?                    # separator
    d{3}                         # first 3 digits
    (s|-|.)                     # separator
    d{4}                         # last 4 digits
    (s*(ext|x|ext.)s*d{2,5})?  # extension
    )''', re.VERBOSE)
```
Note how the previous example uses the triple-quote syntax (''') to create a multiline string so that you can spread the regular expression definition over many lines, making it much more legible.

The comment rules inside the regular expression string are the same as regular Python code: The # symbol and everything after it to the end of the line are ignored. Also, the extra spaces inside the multiline string for the regular expression are not considered part of the text pattern to be matched. This lets you organize the regular expression so it’s easier to read.
查看全文

相关阅读:
基于WebGIS的电子政务应用(基于J2EE的MVC架构)
和菜鸟一起学c之函数中堆栈及运行内存情况
 《父亲》献给程序员的我们
 黑客高级技巧之Linux后门技术及实践
 批处理实现对网站的监测
 无敌批处理
 Adsutil.vbs在脚本入侵中的妙用
 黑客高级技巧之Linux后门技术及实践
 ms 06014漏洞检测代码
 ms 06014漏洞检测代码

原文地址：https://www.cnblogs.com/webRobot/p/5224147.html

re正则表达式16_managing complex regexes

Managing Complex Regexes