hub / github.com/ScrapeGraphAI/Scrapegraph-ai / lazy_load

Method lazy_load

scrapegraphai/docloaders/chromium.py:446–465 · view source on GitHub ↗

Lazily load text content from the provided URLs. This method yields Documents one at a time as they're scraped, instead of waiting to scrape all URLs before returning. Yields: Document: The scraped content encapsulated within a Document object.

(self)

Source from the content-addressed store, hash-verified

444	return [doc async for doc in self.alazy_load()]
445
446	def lazy_load(self) -> Iterator[Document]:
447	"""
448	Lazily load text content from the provided URLs.
449
450	This method yields Documents one at a time as they're scraped,
451	instead of waiting to scrape all URLs before returning.
452
453	Yields:
454	Document: The scraped content encapsulated within a Document object.
455	"""
456	scraping_fn = (
457	self.ascrape_with_js_support
458	if self.requires_js_support
459	else getattr(self, f"ascrape_{self.backend}")
460	)
461
462	for url in self.urls:
463	html_content = asyncio.run(scraping_fn(url))
464	metadata = {"source": url}
465	yield Document(page_content=html_content, metadata=metadata)
466
467	async def alazy_load(self) -> AsyncIterator[Document]:
468	"""

Callers 15

loadMethod · 0.95

test_lazy_load_with_js_supportFunction · 0.95

test_lazy_load_empty_urlsFunction · 0.95

test_lazy_load_invalid_backendFunction · 0.95

test_lazy_load_empty_contentFunction · 0.95

test_lazy_load_scraper_returns_noneFunction · 0.95

test_lazy_load_non_string_scraperFunction · 0.95

test_lazy_load_with_single_url_stringFunction · 0.95

test_lazy_load_with_none_urlsFunction · 0.95

test_lazy_load_sequential_timingFunction · 0.95

test_lazy_load_with_tuple_urlsFunction · 0.95

test_lazy_loadFunction · 0.45

Calls 1

runMethod · 0.45

Tested by 15

test_lazy_load_with_js_supportFunction · 0.76

test_lazy_load_empty_urlsFunction · 0.76

test_lazy_load_invalid_backendFunction · 0.76

test_lazy_load_empty_contentFunction · 0.76

test_lazy_load_scraper_returns_noneFunction · 0.76

test_lazy_load_non_string_scraperFunction · 0.76

test_lazy_load_with_single_url_stringFunction · 0.76

test_lazy_load_with_none_urlsFunction · 0.76

test_lazy_load_sequential_timingFunction · 0.76

test_lazy_load_with_tuple_urlsFunction · 0.76

test_lazy_loadFunction · 0.36

test_lazy_load_exceptionFunction · 0.36