Have you ever felt the pain and boredom of spending lots of time doing research?
INTRODUCING...
Bamboo Pipe is an intelligent NLP pipeline interfaced by an elegant platform, developed for the purpose of analysing large volumes of articles to find the information you need. Not only does it increase the research efficiency, you’ll enjoy yourself the entire time too! This solution has been developed for the analysts at Datarama.
Datarama is a business intelligence company specializing in regulatory compliance. Combining advanced technology and expert human analysis, analysts at Datarama conduct due diligence analysis on companies and provide clients with information about whether they conform to rules, laws, and standards in the business world.
Datarama’s reliance on NexisDiligence as a news provider creates an undesirable dependency on external sources.
NexisDiligence displays false positive results 50% of the time. Hence, analysts need to conduct additional manual checks
that take up time that could have been better spent on more value-added tasks.
Additionally, articles currently stored in their database are unstructured and analysts face a great difficulty finding past
articles to refer to. The process of entering information individually into their internal Ingestion Portal is
repetitive and time-consuming. These issues make up the bulk of the difficulty that analysts face in the daily
routine and we aim to solve these issues by streamlining the time consuming workflow.
PRESENTING OUR SOLUTION
When analysts interact with the React frontend web application,
it will send API requests to the
Bamboo Pipe is a web application that streamlines the evaluation process of textual data and eases Datarama’s information ingestion process.
Users can login using designated username and password
Users can upload a CSV of URLs, or conduct Google search, or manually key in URL. Users can then define entity-sentiment pairs and topics they are interested in
Users can view the filtering progress on the dashboard. Finished results will be shown. A "tick" means a match with users' search parameters.
Users can see the filtering process has finished. They can then proceed to view the output results.
Users can see the highlighted entities with sentiment scores. Users can view the top-3 topics per paragraph identified. Paragraphs will be highlighted accordingly when users hover on the topic results.
Users can download the processed results into CSV and download the articles uploaded in a ZIP file or individual PDF.
LASTLY, THIS IS ENABLED BY OUR ADVANCED NLP TECHNOLOGY
Two Natural Language Processing (NLP) modules were developed to ease the article evaluation process.
Our Entity-Sentiment Analysis model identifies the named entity present in an article and detects the sentiment with respect to it. This analysis is conducted on a sentence level to detect news with negative sentiment surrounding the entity and hence relevant for the analysts.
Our Topic Modelling model identifies the topics present in an article so that analysts can easily understand the gist of an article.