Code
Hub
Workspaces
Connect
Indexed graphs
Engine
MCP
copy
hub
/
github.com/bhaskatripathi/pdfGPT
/ preprocess
Function
preprocess
api.py:24–27 ·
view source on GitHub ↗
(text)
Source
from the content-addressed store, hash-verified
22
23
24
def
preprocess(text):
25
text = text.replace(
'\n'
,
' '
)
26
text = re.sub(
'\s+'
,
' '
, text)
27
return
text
28
29
30
def
pdf_to_text(path, start_page=1, end_page=None):
Callers
1
pdf_to_text
Function · 0.85
Calls
no outgoing calls
Tested by
no test coverage detected