Bővebb ismertető
Preface
There has been much hype in the computer press concerning AI agents that, among other things, scan the Internet and prepare daily personal newspapers. Beneath this mountain of hype exist several technically solid techniques that can be used to
partially automate scanning for interesting online information based on personal preferences
organize retrieved material for local storage
automatically format retrieved information for printing in "personal newspapers"
Textual information on the Internet is available in both plain ASCII text and in structured text, usually World Wide Web (WWW) documents formatted in Hypertext Markup Language (HTML). Text in HTML documents is inherently more valuable than plain ASCII text. The tools developed in this book retain the structure information of HTML formatting in addition to collecting and maintaining content information from plain ASCII text.
The example application developed in this book is a program to generate personal newspapers from information collected