KNC13 OPENGOV | OPEN BS DETECTOR: Baffle-speak sensing and contextualization | Knight News Challenge

OPEN BS DETECTOR | Baffle-speak sensing and contextualization

Short URL: http://j.mp/skknc13obsd

Twitter: @OpenBSDetector

Public officials too often rely on rote phrases devoid of real meaning (baffle-speak) when answering questions of public interest. Open BS Detector spots the sound bites, logs, tracks and alerts about them, and offers useful information and context.

THE TASK
Citizens and journalists need to be able to more quickly and easily discern fact from fiction or misdirection in statements by their politicians and public officials, put those statements in context, and get accurate facts to make informed decisions and take action.

SOLUTION
Open Baffle-Speak Detector will parse those comments and answers to questions, compare them to a historical record by the individual on a topic, and return statistical information, as well as verified facts to the individual seeking a better understanding of a subject and an official’s approach to it.

IMPLEMENTATION
Open Baffle-Speak Detector will have three modes to enable people to identify and contextualize the accuracy, veracity and utility of officials’ statements:

  1. A browser plug-in for use on news articles, online transcripts and other text or Web pages.
  2. Voice-recognition/transcription and a Shazam-like mode for real-time assessment.
  3. A photo/facial recognition and/or augmented reality function that enables the user to easily identify the individual, their track record and overall baffle-speak score.

Our preliminary research discussing this idea among people in the open government movement, journalism, and technologists has been met with enthusiasm. The technology to do this exists or is in development in other arenas, and simply needs to be brought together for an end-to-end, easy experience rather than a patchwork that results in a laborious series of tasks, or a lack of capability in this particular sphere. We plan to collaborate with as many developers of existing projects, tools and technologies as possible.

SIMILAR PROJECTS AND COLLABORATION OPPORTUNITIES
The Truth Goggles, Lazy Truth, Super PAC App, and Churnalism projects are a few examples of similar ideas with different applications and approaches. They focus on parsing text or standardized data. None handle live, real-time input. Open Baffle-Speak Detector will stitch these approaches together for a robust, on-demand tool that works in live situations.

CURRENT STATUS
We have not commenced development on Open Baffle-Speak Detector, but have had initial contacts with people working on other and related projects. We will collaborate and build on Truth Goggles, Hyperaudio and other projects as much as possible to avoid duplicating work.

TEAM
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net.

Collaborators & Advisors:
M. Boas: OpenNews Fellow 2012. Hyperaudio leader, jPlayer HTML5 media library project coordinator, open Web developer.
L. Gridinoc: OpenNews Fellow 2012. Creative technologist specializing in computational linguistics, semantic Web, and visual analytics.
P. Hunter: Over 20 years designing productive interactions between people and technology; expertise in speech recognition, software tools, and education; Fellow in the Leading by Design program at California College of the Arts; veteran of three start-up businesses; currently at Microsoft.
K. Kaushansky: Two decades specializing in speech recognition, voice user interface design, interactive audio experiences, speaker verification, and voice biometrics at startups and global technology firms, including Nortel and Microsoft. Currently at Jawbone.
K. Khan: User experience strategist and designer consulting to governments and Global 1000 corporations, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group.
H. Leson: Director of community engagement, Ushahidi; open source community developer, library and information technician.
M. Saniga, CA: Co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.

FUNDING AND TIMELINE
We anticipate that Open Baffle-Speak Detector will gain interest and uptake among civic development and open government foundations, news organizations, and real-time intelligence companies and investors which would continue to fund development and custom or applications-specific versions.

ONE SENTENCE SUMMARY
Open Baffle Speak Detector is a tool that empowers citizens to identify when public officials give rote vs. real answers in their statements, and adds verified factual context.

LOCATION
Toronto, Ontario, Canada

KNC12 DATA | MARK OF THE BEAST: Data literacy newsgame | Knight News Challenge 2012

One of my favourite among my entries in the Knight News Challenge: Data.

MARK OF THE BEAST | Data literacy newsgame

1. What do you propose to do? [20 words]
Mark of the Beast will tag people with temporary barcode tattoos and RFID as part of a data literacy newsgame.

2. How will your project make data more useful? [50 words]
Mark of the Beast will teach participants about good and bad data handling, hygiene and techniques by making them participants in its dissemination and management through a series of small tasks and exercises. Giving journalists and others data skills will elevate the quality of data-driven journalism, and conduct around data.

3. How is your project different from what already exists? [30 words]
Data education can be dry and tedious, especially with large data sets. Empowering people to have fun with their own and peers’ data can foster a better data journalism culture.

4. Why will it work? [100 words]
Mark of the Beast will succeed because humans are social and learn best through games. It’s how all of us start.
Participants mark themselves with a barcode tattoo and RFID tags to scan and check-in themselves and each other performing various tasks. It is a form of quantified-self data collection. That data is recorded and made available as part of the game. Participants can compete for achievements and rank. Tasks and milestones include processing and handling data as a journalist does. News and data journalism is a feature in the in-game world. Participants have fun, and learn skills.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net.

Support:
Advisors:

  • K. Khan: User experience strategist and designer, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group
  • M. Saniga, CA: President and co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.
  • K. Seto: Founder of Massive Damage game studio and Endloop Mobile. Built Please Stay Calm, a popular location based game.

6. What part of the project have you already built? [100 words]
Mark of the Beast is in the research and planning stage, consulting with domain experts and iterating the project outline.

7. How would you use News Challenge funds? [50 words]
Mark of the Beast would produce a proof-of-concept prototype and test run(s) of the game.

8. How would you sustain the project after the funding expires? [50 words]
A freemium model and traditional investment avenues appear to be the most likely options at this time. There are also several public and private funds available in Ontario and Canada focused on digital media and game development, as well as university lab resource support.

Requested amount: $180,000
Expected number of months to complete project: 12
Total Project Cost: $240,000
Name: Saleem Khan
Twitter: @saleemkhan
Email address [optional]:
Organization: Technovica
City: Toronto
Country: Canada
How did you learn about the contest? I have followed and participated in the News Challenge for years.

KNC12 DATA | DATABLE: Data based dating | Knight News Challenge 2012

One of my (crazier) entries for the Knight News Challenge: Data.

DATABLE | Data based dating

1. What do you propose to do? [20 words]
Datable is the digital dating service for data journalists and other data nerds to find data love.

2. How will your project make data more useful? [50 words]
Data professionals need compatible collaborators in work and life, and the research shows that people who are happy in their work and personal lives are more productive. Enabling people to produce great work, and balance it with a fulfilling life is a win all around.

3. How is your project different from what already exists? [30 words]
Datable will use participant-supplied data to match them for compatibility as data professionals, collaboration, and socially. There are plenty of dating sites and services, and plenty of professional networking sites and services, but none that combine both and add a data focus!

4. Why will it work? [100 words]
Datable will work because we are humans, not robots. People are increasingly disconnected due to days dominated by technology-mediated interactions, and they yearn for human contact. This is supported by the prevalence of technology and interest-related meetups and offline activity events.
Furthermore, Datable will fulfill a professional need for isolated data workers to connect and collaborate in an informal context. Anyone who has ever been to a professional conference or the like knows that the best conversations, ideas and work occur in the informal social settings.
Finally, Datable will be fun. When work is fun, we are more productive.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net. Saleem also likes to ride his mountain bike, watch movies, east, sleep, and has been told he’s witty, fun, has great hair, fingernails, and style,
[I would include other people here but am not sure they want to be mentioned.]
Tip of the hat to Max Shron, data strategist formerly with dating site OKCupid.

6. What part of the project have you already built? [100 words]
Datable is purely a concept that has been discussed among a handful of data enthusiasts.

7. How would you use News Challenge funds? [50 words]
Datable would use the News Challenge funds to seed a proof-of-concept prototype and test.

8. How would you sustain the project after the funding expires? [50 words]
Like many dating sites, advertising and a freemium model, along with traditional investment avenues would apply. The data aspect may afford other opportunities, such as data services and products. This would be part of the exploration.

Requested amount: $85,000
Expected number of months to complete project: 12
Total Project Cost: $125,000
Name: Saleem Khan
Twitter: @saleemkhan twitter.com/saleemkhan
Email address [optional]:
Organization: Technovica
City: Toronto
Country: Canada
How did you learn about the contest? The street, like everyone else.

KNC12 DATA | METANEWS: Story as data to metastory | Knight News Challenge 2012

One of my entries for the Knight News Challenge: Data.

METANEWS: Story as data to metastory

1. What do you propose to do? [20 words]
Metanews is big-data mining tool that parses existing news to discover data-points that reveal deeper stories.

2. How will your project make data more useful? [50 words]
News organizations possess gigantic troves of data that sit idle — and largely forgotten — unless topical new or breaking stories require historical context. It’s the journalism equivalent of dead inventory. Metanews turns that old news into a live asset that can generate new stories.

3. How is your project different from what already exists? [30 words]
Tools and projects to mine data, parse data, and reveal insights exist. None seen solve this problem in journalism since first investigating the project in 1999.

4. Why will it work? [100 words]
Metanews will work for three reasons:
1. For the first time, news organizations’ stories widely exist in a structured or semi-structured digital form. That eliminates a once cost-prohibitive and intensive effort.
2. Even a few years ago, the technology to try to solve this complex problem didn’t exist in an affordable form. It requires processing power that would have cost hundreds of millions of dollars. It is now available for hundreds of thousands of dollars, and falling.
3. Metanews’ project leader, advisors and technical partners have the diverse experience and skills in journalism, data, visualization, design, software and finance to solve the problem.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net.

Support:
Advisors:

  • K. Khan: User experience strategist and designer, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group
  • M. Saniga, CA: President and co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.
  • G. Szeto: Software interface designer/strategist for financial and geopolitical risk intelligence sectors, Fellow at the Center for the Advancement of Public Action at Bennington College.

6. What part of the project have you already built? [100 words]
Metanews has been newly designed and modeled a paper-prototype in 2012, following an early attempt to solve this problem over a decade ago. The project is now recursively refining the model and building a team to create a proof-of-concept prototype.

7. How would you use News Challenge funds? [50 words]
Funds would primarily be used for two functions and support of the same:
1. Gaining access to large-scale computing resources needed.
2. Compensating skilled workers to implement it.
By far, the substantive portion of funds would be applied to resources.

8. How would you sustain the project after the funding expires? [50 words]
Metanews will employ university resources to which it has access, seek sales and/or funding from news organizations, enlist assistance from the open source development community, and seek angel and/or venture capital investment, and grant funding.

Requested amount: $375,000
Expected number of months to complete project: 18
Total Project Cost: $1.2 million
Name: Saleem Khan
Twitter: @saleemkhan http://twitter.com/saleemkhan
Email address [optional]:
Organization: Technovica
City: Toronto
Country: Canada
How did you learn about the contest? Following and participating in the Knight News Challenge for years.

KNC12 DATA | DATAHUB: Data metarepository | Knight News Challenge 2012

One of my entries for the Knight News Challenge: Data.

DATAHUB | Data metarepository

1. What do you propose to do? [20 words]
DataHub is a repository of data repositories, which compiles and curates available databases and datasets for discovery, sharing and collaboration.

2. How will your project make data more useful? [50 words]
The big problem with data isn’t a lack of it, it’s finding it and how to use it. DataHub will help data users to collaboratively find, scrub and share data and access to it.

3. How is your project different from what already exists? [30 words]
The Ujima Project resembles DataHub. It focuses on particular public databases with specific criteria. DataHub is open, social, and would include private databases, datasets, repositories and other tools and resources.

4. Why will it work? [100 words]
There is a real need and hunger in journalism circles for collaborative data sourcing and sharing, especially in non-competitive circumstances, or after the initial news scoop has been won and the originating journalist moves on to other stories.
Conversations at every journalism conference and informal gathering of journalists who work with data always touch on this yearning.
Fulfilling this need by enabling journalists — and other data hounds — to collaborate helps everyone, and would likely lead to more and better data-driven journalism.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net.

Support:
Advisors:

  • K. Khan: User experience strategist and designer, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group
  • M. Saniga, CA: President and co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.
  • G. Szeto: Software interface designer/strategist for financial and geopolitical risk intelligence sectors, Fellow at the Center for the Advancement of Public Action at Bennington College.

6. What part of the project have you already built? [100 words]
DataHub is in the concept phase, and is gathering professionals’ feedback (all positive) and suggestions for improvement.

7. How would you use News Challenge funds? [50 words]
DataHub would invest any funds into product development and ancillary support.

8. How would you sustain the project after the funding expires? [50 words]
A freemium pricing model, paid features for participants to highlight their data, traditional capital investment and grant funding, as well as ancillary merchandise for sale.

Requested amount: $250,000
Expected number of months to complete project: 12
Total Project Cost: $350,000
Name: Saleem Khan
Twitter: @saleemkhan twitter.com/saleemkhan
Email address [optional]:
Organization: Technovica
City: Toronto
Country: Canada
How did you learn about the contest? I have been following and participating in the Knight News Challenge for years.

KNC12 DATA | DATAFRAME: Distributed data journalism microwork | Knight News Challenge 2012

One of my entries for the Knight News Challenge: Data.

DATAFRAME | Distributed data journalism microwork

1. What do you propose to do? [20 words]
DataFrame is a platform or framework that lets people help journalists process large datasets through microwork.

2. How will your project make data more useful? [50 words]
Journalists often have to scrub, mine, highlight, cherry-pick, search for patterns and correlations, and perform other labour-intensive tasks on large data sets. DataFrame will farm out small chunks of data to the public to process, and thereby increase the speed and quality of news it generates.

3. How is your project different from what already exists? [30 words]
Other examples of this kind of microwork involve specialized, dedicated tasks, such as the search for extraterrestrial intelligence or protein folding. DataFrame would enable more differentiated, low-load tasks for news.

4. Why will it work? [100 words]
The distributed microwork model has been proven over and over again in multiple instances, as noted above, as well as in journalism. The Guardian’s MP expenses investigation is a recent example. People like to contribute to a public good, especially when it takes minimal effort, and especially when they can make a game of it with peers.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]; founder, invstg8.net.

Support:
Advisors:

  • K. Khan: User experience strategist and designer, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group
  • M. Saniga, CA: President and co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.
  • G. Szeto: Software interface designer/strategist for financial and geopolitical risk intelligence sectors, Fellow at the Center for the Advancement of Public Action at Bennington College.

6. What part of the project have you already built? [100 words]
DataFrame is in the concept phase and is gathering feedback and insights on the plan from experts and ordinary citizens alike. A paper/visual prototype and Web-based mock-up are the next steps.

7. How would you use News Challenge funds? [50 words]
Funding will be applied to building a small, dedicated team to rapidly progress from prototype to product, and attendant costs.

8. How would you sustain the project after the funding expires? [50 words]
Once the product is developed, maintenance costs should be minimal. The project would follow a freemium model, with larger news organizations paying for a licence, pay-per-use or volume fee schedule. DataFrame would also pursue angel or venture investment and additional grant funding. Ancillary merchandise will also be offered for sale to users.

Requested amount: $190,000
Expected number of months to complete project: 12
Total Project Cost: $250,000
Name: Saleem Khan
Twitter: @saleemkhan twitter.com/saleemkhan
Email address [optional]:
Organization: Technovica
City: Toronto
Country: Canada
How did you learn about the contest? I have been following and participating in the Knight News Challenge for years.

KNC12 DATA | INVSTG8.NET: Deep data sourcing via simple data matching | Knight News Challenge 2012

One of my entries for the Knight News Challenge: Data.

INVSTG8.NET | Deep data sourcing via simple data matching

1. What do you propose to do? [20 words]
Invstg8.net aims to create a keyword/concept matching tool to help connect journalists to story-crucial peer-sourced data.

2. How will your project make data more useful? [50 words]
Invstg8.net enables journalists get access to, and use, otherwise inaccessible/unused, crucial data for stories in a highly targeted manner.

3. How is your project different from what already exists? [30 words]
The invstg8.net tool uses simple data in a trusted, secured framework to focus on the human connection to obtaining deep, targeted, existent but unsurfaced data.

4. Why will it work? [100 words]
Invstg8.net eliminates structural, practical, technological and financial barriers by linking journalists anywhere, who find it impossible to get data, to peers with access.
The invstg8.net tool will filter and match journalists seeking data with those likely to have access to it, in a highly targeted manner.
Invstg8.net will accelerate reporting on stories it would take months or years to tell, or that wouldn’t be told at all for lack of data.
Invstg8.net was inspired by an African journalist unable to obtain data crucial to his foreign mining investigation. He said it would take him months to get data I could get in minutes.

5. Who is working on it? [100 words]
Saleem Khan: Project leader, journalist [editor and reporter, ex- CBC, Metro International, Toronto Star newspapers; chairman/director, Canadian Association of Journalists]; advisor, University of Toronto ThingTank Lab [Faculty of Information]

Support:
Advisors:

  • K. Khan: User experience strategist and designer, OCAD University sLab advisor; leader of UXI, Canada’s largest UX professionals group
  • H. Leson: Director of community engagement, Ushahidi; open source community developer, library and information technician.
  • M. Saniga, CA: President and co-founder, near-realtime business intelligence/data insight generation software firm Quant Inc.; former finance director and manager at Cara, Dell.

Code team:
Various ad hoc/pro bono.

6. What part of the project have you already built? [100 words]
A functioning development-environment prototype of the query, receipt and request-handling mechanism that accepts open SMS has been built. Code is available at http://j.mp/skinvstg8netgit [ Github.com/saleemkhan/invstg8net ].
The keyword/concept based request engine that forms the core of invstg8.net is still needed.

7. How would you use News Challenge funds? [50 words]
Funds would be applied toward development of the invstg8.net keyword/concept request-matching engine and related support. At its core, this means hiring dedicated developers to advance the project at a pace far faster than the current, incrmental rate of progress.

8. How would you sustain the project after the funding expires? [50 words]
We have encountered venture capitalists, private companies, NGOs and news organizations that have expressed interest in investing in invstg8.net once a working prototype is available, or in purchasing beta or release-candidate licences.
We will also sustain the project by selling consulting services and training, a freemium pricing model, and ancillary merchandise.

Requested amount: $350,000
Expected number of months to complete project: 12
Total Project Cost: $525,000
Name: Saleem Khan
Twitter: @saleemkhan http://twitter.com/saleemkhan
Email address [optional]: saleem.khan@invstg8.net
Organization: invstg8.net / Technovica
City: Toronto
Country: Canada
How did you learn about the contest? I have been following and participating in the Knight News Challenge for years.