Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Known URLs: allow incoming to be "Neutral" (vs. default "Relevant") #5

Open
ctwardy opened this issue Apr 4, 2017 · 0 comments

Comments

@ctwardy
Copy link
Contributor

ctwardy commented Apr 4, 2017

[Discussed on Slack #pagetype ~3 weeks ago. I dad accidentally posted this to thh-classifiers.]

In "Add Known URLs", the user can supply a line-separated list of ostensibly-known URLs. Currently they come in as "Relevant". I'd like these pages to come in tagged as "Neutral", and then review.

In my use case, I am using SiteHound to find the relevant ones. I supply hundreds of likely, but unverified URLs from past crawls. Many were once relevant, but are now 404 or expired domains. Some were simply false positives. Starting "Neutral" makes it easy to find the pages not yet sorted into "Relevant" and "Irrelevant".

That will be esp. important if I later add a new batch of pages to review: coming in "Neutral" allows users easily to find and tag them.

(Similarly the option could be extended to user-defined labels, though in my case coming in unlabeled is just right.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant