Clean Raw Article

Converts raw HTML article content to cleaned markdown format. Performs regex-based cleaning and HTML-to-markdown conversion without using LLM.

Job Metadata

Job Kind
clean_raw_article
Queue
system
Type
System

Used by Workflows

benzinga_article_processing
Stage: clean_raw_article
View →
general_article_processing
Stage: clean_raw_article
View →