MCPcopy
hub / github.com/ArtifexSoftware/pdf2docx / compare_pdf

Function compare_pdf

test/test.py:175–195  ·  view source on GitHub ↗
(pdf1, pdf2, num_pages=None)

Source from the content-addressed store, hash-verified

173
174
175def compare_pdf(pdf1, pdf2, num_pages=None):
176 #print(f'Comparing {pdf1=} {pdf2=}')
177 with fitz.Document(pdf1) as doc1, fitz.Document(pdf2) as doc2:
178 if num_pages:
179 n1 = num_pages
180 else:
181 n1 = len(doc1)
182 n2 = len(doc2)
183 if n1 != n2:
184 print(f'Differing numbers of pages: {n1=} {n2=}.')
185 return -1
186 sidx = 0
187 # Find average similarity.
188 for n in range(n1):
189 diff_png = f'{pdf2}.diff.{n}.png'
190 sidx_n = get_page_similarity(doc1[n], doc2[n], diff_png)
191 #print(f'Page {n}: {diff_png} {sidx_n=}.')
192 sidx += sidx_n
193 sidx /= n1
194 #print(f'{sidx=}')
195 return sidx
196
197
198class TestConversion:

Callers 1

test_oneFunction · 0.85

Calls 1

get_page_similarityFunction · 0.85

Tested by

no test coverage detected

Used in the wild real call sites across dependent graphs

searching dependent graphs…