MCPcopy
hub / github.com/PaddlePaddle/PaddleFormers / process_outside_codeblocks

Function process_outside_codeblocks

scripts/codestyle/check_spaces.py:29–48  ·  view source on GitHub ↗
(text)

Source from the content-addressed store, hash-verified

27
28
29def process_outside_codeblocks(text):
30 # 正则表达式用于匹配Markdown代码块
31 codeblock_pattern = r"```[\s\S]*?```"
32
33 # 找到所有的代码块并替换为占位符
34 codeblocks = re.findall(codeblock_pattern, text)
35 placeholders = []
36 for i, block in enumerate(codeblocks):
37 placeholder = f"CODEBLOCK_PLACEHOLDER_{i}"
38 placeholders.append(placeholder)
39 text = text.replace(block, placeholder, 1)
40
41 # 对非代码块文本处理中英文空格
42 processed_text = add_spaces_between_chinese_and_english(text)
43
44 # 将占位符替换回原来的代码块内容
45 for placeholder, block in zip(placeholders, codeblocks):
46 processed_text = processed_text.replace(placeholder, block, 1)
47
48 return processed_text
49
50
51def process_file(file_path):

Callers 1

process_fileFunction · 0.85

Calls 3

appendMethod · 0.45
replaceMethod · 0.45

Tested by

no test coverage detected