大家是如何学习正则表达式的？或者有什么学习的建议？

emacs_ran · 2020 年2 月 18 日 02:42

你猜他在干嘛?

— My reply:

I am guessing you are try to find this string

lkjciidsofwopvqi-something

in this string long string

flkjciidsofwopvqi-something

But I got:

Debugger entered--Lisp error: (invalid-read-syntax ") or . in a vector")
  read(#<buffer *scratch*>)
  elisp--preceding-sexp()
  elisp--eval-last-sexp(nil)
  eval-last-sexp(nil)
  funcall-interactively(eval-last-sexp nil)
  call-interactively(eval-last-sexp nil nil)
  command-execute(eval-last-sexp)

Looks like I am missing some context here.

This is my learning:

matches two or more blank lines in sequence.

(re-search-forward "^\n\\{2,\\}")

match TODO

(re-search-forward "\\[TODO\\](\$[^\$]+\\))" nil t)

match UUID

(re-search-forward "PDFNAME::\$[a-zA-Z0-9 \.\_\,\-]+\$\.pdf - " nil t)

match page number

(re-search-forward "PAGENUM::\$[0-9]+\$::END" nil t)

match secrete

(re-search-forward (concat arg "::\$.*\$"))

match @dfn{xxx}

(re-search-forward "@dfn{\$[^}]+\$}" nil t)

match end sentence

(re-search-forward "[\.,;:!\?]" end t)

If you know python: and interested in nCoV:

github.com

BlankerL/DXY-COVID-19-Crawler/blob/master/service/crawler.py#L47-L51


      
          
          overall_information = re.search(r'(\{"id".*\})\}', str(soup.find('script', attrs={'id': 'getStatisticsService'})))
          if overall_information:
              self.overall_parser(overall_information=overall_information)

    overall_information = re.search(r'\{("id".*?)\]\}', str(soup.find('script', attrs={'id': 'getStatisticsService'})))
    province_information = re.search(r'\[(.*?)\]', str(soup.find('script', attrs={'id': 'getListByCountryTypeService1'})))
    area_information = re.search(r'\[(.*)\]', str(soup.find('script', attrs={'id': 'getAreaStat'})))
    abroad_information = re.search(r'\[(.*)\]', str(soup.find('script', attrs={'id': 'getListByCountryTypeService2'})))
    news = re.search(r'\[(.*?)\]', str(soup.find('script', attrs={'id': 'getTimelineService'})))

if you intersted in bash

find xxx from grep_xxx

$(grep -oP '(?<=aoa_)[0-9]+' tmp)

inspired by @manateelazycat

@Action

方法论: 提问还是搜索? 闲聊灌水

StackOverflow突破10k, [20200213-223033] 抛一个问题, 提问还是搜索? 答案: 只提问不搜索! 原因如下: 提问的过程, 也是梳理思路的过程, 往往问题还没写完, 答案便有了. 遇事便搜索, 有碍于良好思维习惯的养成; 搜索时, 注意力比较容易被操纵, 原本只是简单的问题, 过程中却被野花野草所吸引, 浪费大量时间; 提问能够钓到在当前知识水平与认知水平之上的解答, 更进一步可以裁判不同高手的答案; 而去搜索往往使用局限在当下的水平. 大部分问题, 并非直接挡住当前去路, 略过之后, 完全可以继续做事; 因此可以将问题抛出去; 提问更高效, 一个上午遇到5个问题, 全部提出来; 11:30的时候, 收割到4个答案, 只投入时间解决最后一个问题; 提问的心态: 本来就占用别人时间, 所以没脸没皮, 不要在乎难听的话与个人面子; 回答的心态: 作为个人的查漏补缺, 并且只回答自己感兴趣的问题, 这一条也可以反过来作为提问者的心态. 本文为项目"步步为营, 零秒精通Emacs"的Appedix A: Ask and Harves…

why github is unhappy with rep name regexp? Possible related to their reg engine?

oh my god, why regexp is everywhere…
proudly power by org-mode