Package: rdomains 0.4.0
rdomains: Get the Category of Content Hosted by a Domain
Get the category of content hosted by a domain. Use Shallalist (service discontinued), 'VirusTotal' (which provides access to lots of services) <https://www.virustotal.com/>, 'DMOZ' <https://archive.org/details/dmoz-rdf-20150327>, University Domain list <https://github.com/Hipo/university-domains-list>, 'OpenAI' 'GPT' models, 'Anthropic' 'Claude' models, or validated machine learning classifiers based on 'Shallalist' data to learn about the kind of content hosted by a domain.
Authors:
rdomains_0.4.0.tar.gz
rdomains_0.4.0.zip(r-4.7)rdomains_0.4.0.zip(r-4.6)rdomains_0.4.0.zip(r-4.5)
rdomains_0.4.0.tgz(r-4.6-any)rdomains_0.4.0.tgz(r-4.5-any)
rdomains_0.4.0.tar.gz(r-4.7-any)rdomains_0.4.0.tar.gz(r-4.6-any)
rdomains_0.4.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
rdomains/json (API)
| # Install 'rdomains' in R: |
| install.packages('rdomains', repos = c('https://soodoku.r-universe.dev', 'https://cloud.r-project.org')) |
This package does not link to any Github/Gitlab/R-forge repository. No issue tracker or development information is available.
Last updated from:e0d361ba52. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 211 | ||
| source / vignettes | OK | 200 | ||
| linux-release-x86_64 | OK | 182 | ||
| macos-release-arm64 | OK | 221 | ||
| macos-oldrel-arm64 | OK | 195 | ||
| windows-devel | OK | 120 | ||
| windows-release | OK | 124 | ||
| windows-oldrel | OK | 126 | ||
| wasm-release | OK | 117 |
Exports:adult_ml1_catclaude_catdmoz_catget_dmoz_dataget_shalla_dataget_stevenblack_dataglm_shallanot_newsopenai_catshalla_catstevenblack_catuni_catvirustotal_cat
Dependencies:askpassbackportsbase64encbitbit64checkmateclicliprcodetoolscpp11crayoncurldplyrforeachgenericsglmnetgluehmshttriteratorsjsonlitelatticelifecyclemagrittrMatrixmimeopensslpillarpkgconfigprettyunitsprogresspurrrR.methodsS3R.ooR.utilsR6RcppRcppEigenreadrrlangshapestringistringrsurvivalsystibbletidyselecttriebeardtzdburltoolsutf8vctrsvirustotalvroomwithrXMLxml2
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| rdomains: Classify Domains by their Content | rdomains-package rdomains |
| Probability that Domain Hosts Adult Content Based on features of Domain Name and Suffix alone. | adult_ml1_cat |
| Get Category from Anthropic Claude | claude_cat |
| Get Category from DMOZ | dmoz_cat |
| Get DMOZ Data | get_dmoz_data |
| Get Shalla Data | get_shalla_data |
| Get Steven Black's Host List Data | get_stevenblack_data |
| ML Model | glm_shalla |
| Classify News and Non-News Based on keywords in the URL | not_news |
| Get Category from OpenAI | openai_cat |
| Get Category from Shallalist | shalla_cat |
| Get Category from Steven Black's Host List | stevenblack_cat |
| Get Category from University Domain List | uni_cat |
| Get Category from VirusTotal | virustotal_cat |
