MCPcopy
hub / github.com/codelucas/newspaper / remove_scripts_styles

Method remove_scripts_styles

newspaper/cleaners.py:102–116  ·  view source on GitHub ↗
(self, doc)

Source from the content-addressed store, hash-verified

100 return doc
101
102 def remove_scripts_styles(self, doc):
103 # remove scripts
104 scripts = self.parser.getElementsByTag(doc, tag='script')
105 for item in scripts:
106 self.parser.remove(item)
107 # remove styles
108 styles = self.parser.getElementsByTag(doc, tag='style')
109 for item in styles:
110 self.parser.remove(item)
111 # remove comments
112 comments = self.parser.getComments(doc)
113 for item in comments:
114 self.parser.remove(item)
115
116 return doc
117
118 def clean_bad_tags(self, doc):
119 # ids

Callers 1

cleanMethod · 0.95

Calls 3

getElementsByTagMethod · 0.80
removeMethod · 0.80
getCommentsMethod · 0.80

Tested by

no test coverage detected