MCPcopy
hub / github.com/Snailclimb/JavaGuide / extractUrlsFromSitemap

Function extractUrlsFromSitemap

scripts/docsearch-index.mjs:91–99  ·  view source on GitHub ↗
(xml)

Source from the content-addressed store, hash-verified

89}
90
91function extractUrlsFromSitemap(xml) {
92 const urls = [...xml.matchAll(/<loc>(.*?)<\/loc>/g)]
93 .map((match) => match[1].trim())
94 .filter((url) => url.startsWith("https://javaguide.cn/"))
95 .filter((url) => !url.includes("/assets/"))
96 .filter((url) => !url.endsWith("/404.html"));
97
98 return maxUrls > 0 ? urls.slice(0, maxUrls) : urls;
99}
100
101function recordFor({ url, title, hierarchy, content, anchor, type, position }) {
102 const recordUrl = anchor ? `${url}#${anchor}` : url;

Callers 1

mainFunction · 0.70

Calls

no outgoing calls

Tested by

no test coverage detected