从html,txt等文件中提取数据,并加入到数据库中的程序。
http://goldseeker.sourceforge.net/
GoldSeeker is a data extraction application. It was built to extract formatted data
from HTML files, but can be used with all kind of files.
Its behaviour is defined by a rule-based configuration file. It can process files on
the local server, or directly get web pages via http://.
GS is still in development, it's neither whole nor stable; nevertheless it can
already be used for simple extractions.
Trackback: http://tb.donews.net/TrackBack.aspx?PostId=32133