No Pilots, No Demos presents...
- Koding URL: http://aphelionz.koding.io
tl;dr? watch the video here https://www.youtube.com/watch?v=QqqkmepANGg
ToSBot is an app that reads ANY Terms of Service page and alerts you to potential legal, moral, and ethical issues that may exist within. Powered by a neural network seeded by the fantastic dataset at http://tosdr.org, this app learns as it goes and improves itself to give you more accurate results.
That being said, we encourage human input by providing a "feedback" mode which allows people to give feedback on whether the privacy policy or terms of service is better or worse in regards to the concern we raise. For example, if our machine says a particular ToS doesn't allow users to delete their data, and the language is unclear or even the opposite, we make it easy for a user to tell us that.
- A corpus of text was gathered via ToSBack (thanks to EFF and ToSDr.org)
- The corpus was tokenized, or split into its consituent wordlike parts.
- For each document each token was ranked according to its frequency in the corpus at large.
- These document-specific lists of tokens were then sorted in descending order to create a signature
- The network was trained using "known" human-generated analysis of the legal pages provided by ToSDR.org
- Using the combination of the sorted token analysis, and the legap page analysis, the network was trained.
- From this point, each NEW document is analyzed and analysis points are probabilistically determined.
- Also, each new document is added to the corpus, and thus the machine learns anew.
- Short URLs are generated via RuGu Shrink URL shortening API (provided by Mashape)
- All feedback from "Feedback Mode" are captured via Sprint.ly tickets
- APITools is used to track all API calls
- nginx powers the HTML/CSS/JS in the Web folder of the Koding
- Count.ly username: [email protected]
- ODesk username: nopilotsnodemos (member since 2010)
| aphelionz (Mark) | skybondsor (Jordyn) |
|---|---|
![]() |
![]() |
Jordyn and Mark have been coding separately for a combined 26 years, and together both on and off the job for over 5 years. We initially joined the hackathon because of the opportunity to do something based on climate change. However, when the topics were announced, we both felt drawn towards the privacy policy / terms of service topic.


