Update README to match new DomainClassifier

This commit is contained in:
Alexandre Dulaunoy 2012-01-24 11:58:00 +01:00
parent 9f4ae469d9
commit b5bfadb368

View file

@ -4,11 +4,35 @@ DomainClassifier
DomainClassifier is a simple Python library to extract and classify Internet
domains from raw text files following their existence, localization or attributes.
How To Use It
-------------
```python
import domainclassifier
c = domainclassifier.Extract( rawtext = "this is a text with a domain called test@foo.lu another test abc.lu something a.b.c.d.e end of 1.2.3.4 foo.be www.belnet.be http://www.cert.be/ www.public.lu www.allo.lu quuxtest www.eurodns.com something-broken-www.google.com www.google.lu trailing test")
# extracting potentially valid domains from rawtext
print c.domain()
# reduce set of potentially valid domains to existing domains
# (based on SOA,A,AAAA,CNAME,MX records)
print c.validdomain(extended=True)
# reduce set of valid domains with DNS records associated to a
# specified country
print "US:"
print c.localizedomain(cc='US')
print "LU:"
print c.localizedomain(cc='LU')
print "BE:"
print c.localizedomain(cc='BE')
```
### Software Required
* Python (tested successfully on version 2.6)
* dnspython library - http://www.dnspython.org/
* IPy library
### License