Bullshit Detector Prototype Goes Live

I like writing about cool applications of technology that are so pregnant with the promise of the future, that they have to be seen to be believed, and here’s another one that’s almost ready for prime time.

TruthTeller PrototypeThe Washington Post today launched an exciting new technology prototype invoking powerful new technologies for journalism and democratic accountability in politics and government. As you can see from the screenshot (left), it runs an automated fact-checking algorithm against the streaming video of politicians or other talking heads and displays in real time a “True” or “False” label as they’re speaking.

Called “Truth Teller,” the system uses technologies from Microsoft Research and Windows Azure cloud-computing services (I have included some of the technical details below).

But first, a digression on motivation. Back in the late 1970s I was living in Europe and was very taken with punk rock. Among my favorite bands were the UK’s anarcho-punk collective Crass, and in 1980 I bought their compilation LP “Bullshit Detector,” whose title certainly appealed to me because of my equally avid interest in politics :)

Today, my driving interests are in the use of novel or increasingly powerful technologies for the public good, by government agencies or in the effort to improve the performance of government functions. Because of my Jeffersonian tendencies (I did after all take a degree in Government at Mr. Jefferson’s University of Virginia), I am even more interested in improving government accountability and popular control over the political process itself, and I’ve written or spoken often about the “Government 2.0″ movement.

In an interview with GovFresh several years ago, I was asked: “What’s the killer app that will make Gov 2.0 the norm instead of the exception?”

My answer then looked to systems that might “maintain the representative aspect (the elected official, exercising his or her judgment) while incorporating real-time, structured, unfiltered but managed visualizations of popular opinion and advice… I’m also a big proponent of semantic computing – called Web 3.0 by some – and that should lead the worlds of crowdsourcing, prediction markets, and open government data movements to unfold in dramatic, previously unexpected ways. We’re working on cool stuff like that.”

The Truth Teller prototype is an attempt to construct a rudimentary automated “Political Bullshit Detector, and addresses each of those factors I mentioned in GovFresh - recognizing the importance of political leadership and its public communication, incorporating iterative aspects of public opinion and crowd wisdom, all while imbuing automated systems with semantic sense-making technology to operate at the speed of today’s real world.

Real-time politics? Real-time truth detection.  Or at least that’s the goal; this is just a budding prototype, built in three months.

Cory Haik, who is the Post’s Executive Producer for Digital News, says it “aims to fact-check speeches in as close to real time as possible” in speeches, TV ads, or interviews. Here’s how it works:

The Truth Teller prototype was built and runs with a combination of several technologies — some new, some very familiar. We’ve combined video and audio extraction with a speech-to-text technology to search a database of facts and fact checks. We are effectively taking in video, converting the audio to text (the rough transcript below the video), matching that text to our database, and then displaying, in real time, what’s true and what’s false.

We are transcribing videos using Microsoft Audio Video indexing service (MAVIS) technology. MAVIS is a Windows Azure application which uses State of the Art of Deep Neural Net (DNN) based speech recognition technology to convert audio signals into words. Using this service, we are extracting audio from videos and saving the information in our Lucene search index as a transcript. We are then looking for the facts in the transcription. Finding distinct phrases to match is difficult. That’s why we are focusing on patterns instead.

We are using approximate string matching or a fuzzy string searching algorithm. We are implementing a modified version Rabin-Karp using Levenshtein distance algorithm as our first implementation. This will be modified to recognize paraphrasing, negative connotations in the future.

What you see in the prototype is actual live fact checking — each time the video is played the fact checking starts anew.

 - Washington Post, “Debuting Truth Teller

The prototype was built with funding from a Knight Foundation’s Prototype Fund grant, and you can read more about the motivation and future plans over on the Knight Blog, and you can read TechCrunch discussing some of the political ramifications of the prototype based on the fact-checking movement in recent campaigns.

Even better, you can actually give Truth Teller a try here, in its infancy.

What other uses could be made of semantic “truth detection” or fact-checking, in other aspects of the relationship between the government and the governed?

Could the justice system use something like Truth Teller, or will human judges and  juries always have a preeminent role in determining the veracity of testimony? Will police officers and detectives be able to use cloud-based mobile services like Truth Teller in real time during criminal investigations as they’re evaluating witness accounts? Should the Intelligence Community be running intercepts of foreign terrorist suspects’ communications through a massive look-up system like Truth Teller?

Perhaps, and time will tell how valuable – or error-prone – these systems can be. But in the next couple of years we will be developing (and be able to assess the adoption of) increasingly powerful semantic systems against big-data collections, using faster and faster cloud-based computing architectures.

In the meantime, watch for further refinements and innovation from The Washington Post’s prototyping efforts; after all, we just had a big national U.S.  election but congressional elections in 2014 and the presidential race in 2016 are just around the corner. Like my fellow citizens, I will be grateful for any help in keeping candidates accountable to something resembling “the truth.”

2012 Year in Review for Microsoft Research

The year draws to a close… and while the banality and divisiveness of politics and government has been on full display around the world during the past twelve months, the past year has been rewarding for me personally when I can retreat into the world of research. Fortunately there’s a great deal of it going on among my colleagues.

2012 has been a great year for Microsoft Research, and I thought I’d link you to a quick set of year-in-review summaries of some of the exciting work that’s been performed and the advances made:

Microsoft Research 2012 Year in Review

The work ranges from our Silicon Valley lab work in “erasure code” to social-media research at the New England lab in Cambridge, MA; from “transcending the architecture of quantum computers” at our Station Q in Santa Barbara, to work on cloud data systems and analytics by the eXtreme Computing Group (XCG) in Redmond itself.

Across global boundaries we have seen “work towards a formal proof of the Feit-Thompson Theorem” at Microsoft Research Cambridge (UK), and improvements for Bing search in Arab countries made at our Advanced Technology Labs in Cairo, Egypt.

All in all, an impressive array of research advance, benefiting from an increasing amount of collaboration with academic and other researchers as well. The record is one more fitting tribute to our just-departing Chief Research and Strategy Officer Craig Mundie, who is turning over his reins including MSR oversight to Eric Rudder (see his bio here), while Craig focuses for the next two years on special work reporting to CEO Steve Ballmer. Eric’s a great guy and a savvy technologist, and has been a supporter of our Microsoft Institute’s work as well … I did say he’s savvy :)

There’s a lot of hard work already going on in projects that should pay off in 2013, and the New Year promises to be a great one for technologists and scientists everywhere – with the possible exception of any remaining Mayan-apocalypse/ancient-alien-astronaut-theorists. But even to them, and perhaps most poignantly to them, I say Happy New Year!

Tech Trip to Argentina

With Luis Ruvira, President of the Argentine American Dialogue Foundation, after my speech at the Argentine Council on International Relations

I’m traveling in Argentina this week, on a trip sponsored by the U.S. Department of State in their official Speaker’s Program. The U.S. Embassy in Buenos Aires had requested of DoS an American technology speaker “who can talk about technology innovation and bleeding edge kinds of things.  The goal is to highlight the role that innovation and technology plays in creating a better society.” I was delighted to accept the invitation when asked by my friend Lovisa Williams of the State Department’s Internet Steering Committee.

Most of the trip is being spent in Buenos Aires, second largest city in South America – so large it is constitutionally recognized as an autonomous federal entity alongside the 23 Argentinian provinces, with its own government ministers and municipal administration. I am also enjoying side visits to Rosario and La Plata, large cities and provincial capitals. I’ll write about several aspects of the trip separately.

Working together to cram in a series of whirlwind meetings have been my excellent co-hosts, the U.S. Embassy and the respected Argentine American Dialogue Foundation. Below are the highlights of the visit, plucked from my official agenda:

Monday 9/19: Meeting with the Minister of Education for Buenos Aires city and visit to the Escuela Gauchos de Guemes school which is studying the social and educational benefits of having given each child their own netbook. Tour of the Universidad Abierta Interamericana (UAI) (the Open InterAmerican University), visiting their robotics labs, meetings with engineering students, and a separate meeting with authorities from the university and national civil servants. Meeting with Pedro Janices, National Director at the National Office for Information Technologies (executive-branch component of the President’s Office; Pedro has been called “the Argentine CIO,” and has worked with the first U.S. CIO Vivek Kundra.)

Tuesday 9/20: Public speech at the Argentina Council for International Relations (CARI, one of the most important think tanks in Latin America), topic: “Governments 2.0 and the impact of new technologies.” Lecture at the American Club of Buenos Aires, with participating companies from the American Chamber of Commerce of Argentina, members from the academic sector and public servants (including the Head of International Relations of the National Ministry in Science and Technology). Tour of the Supreme Court of Argentina, meeting with Deputy Chief Justice Highton, who was the first woman appointed to the Court (under a democratic government).  Videoconference lecture on “Innovation and Government” at the National Technological University (UTN), transmitted live to 13 campuses of the University in the interior of the country.

Wednesday 9/21: Trip to Rosario, second largest city in Argentina and capital of Santa Fe Province. Visit and tour of largest tech firm in Rosario, Neoris; lunch with Neoris Latin American President Martin Mendez.  Meeting with the Secretary of Production and Local Development for the city of Rosario, subject “Creating conditions for local technology-industry growth.” Meeting with Rocio Rius of the Fundacion Nueva Generación Argentina (Argentina New Generation Foundation). Lecture at the Universidad Abierto Interamericana (UAI) campus in Rosario on new technologies and their impact on government; audience of authorities and students from UAI and other universities, faculty from the Engineering School, and also local public servants.

Thursday 9/22: Trip to La Plata, capital city of Buenos Aires Province.  Meeting with Governor Daniel Scioli (Vice President of Argentina 2003-2007) and other provincial civil servants, including Undersecretary of Institutional Relations, Director of Interministerial Relations, and Chief of Cabinet.  Public Lecture at the National University of La Plata, guest of Dean of the Informatics Faculty.

Friday 9/23: Participate in opening ceremonies in Buenos Aires of the IX Congreso Internacional en Innovación Tecnológica Informática (CIITI, Ninth International Congress on IT Innovation). Visit to Universidad Argentina de la Empresa, (UADE, Argentine University of Enterprise), meetings with faculty/students from Government, Law, and Engineering departments, and tours of laboratories. Lecture at the American Club of Buenos Aires. Meeting with Director of the Business School at Argentine Catholic University, and Dean of the Faculty of Economic Sciences. Private meeting at Embassy with U.S. Ambassador Vilma Martinez. Panel speaker on “Ciberculture Y Gobierno” (Cyber-culture and Government) at the IX Congreso CIITI with international panel.

I’ve been on several other State Department-sponsored trips before (to Mexico and, many years ago near the end of the Cold War, to the Soviet Union), but I must say that this frenetically busy jaunt through lovely Argentina may be my favorite. I’ll write more over the next few days.

Share this post on Twitter

Email this post to a friend

MSR gets wired, WIRED gets MSR

MS Research in natural-user-interaction technologies
MSR natural-user-interaction immersive technologies

WIRED Magazine’s online site ran a great long profile of Microsoft Research late yesterday, with interviews and project features: “How Microsoft Researchers Might Invent a Holodeck.”

I have written about or mentioned all of the individual projects or technologies on my blog before, but the writing at WIRED is so much better than my own – and the photographs so cool – that I thought I should post a link to the story. Continue reading

Virtual recipe stirs in Apple iPad, Microsoft Kinect

Who says Apple and Microsoft can’t work together?  They certainly do, at least when it involves the ingenuity of their users, the more inventive of whom use technologies from both companies (and others).

Here’s a neat example, “a just-for-fun experiment from the guys at Laan Labs” where they whip up a neat Augmented Reality recipe: take one iPad, one Kinect, and stir:

Some technical detail from the Brothers Laan, the engineers who did the work:

We used the String Augmented Reality SDK to display real-time 3d video+audio recorded from the Kinect. Libfreenect from http://openkinect.org/ project was used for recording the data coming from the Kinect. A textured mesh was created from the calibrated depth+rgb data for each frame and played back in real-time. A simple depth cutoff allowed us isolate the person in the video from the walls and other objects. Using the String SDK, we projected it back onto a printed image marker in the real world.” - source, Laan Labs blog.

As always, check out http://www.kinecthacks.com/ for the latest and greatest Kinect hacks – or more accurately now, the latest cool uses of the openly released free Kinect SDK, available here.

There are several quiet projects underway around the DC Beltway to make use of the SDK, testing non-commercial but government-relevant deployments – more detail and examples at the appropriate time. We will eventually release a commercial SDK with even more functionality and higher-level programming controls, which will directly benefit government early adopters.

In the meantime, I may report on some of the new advances being made by our research group on Computational User Experiences, who “apply expertise in machine learning, visualization, mobile computing, sensors and devices, and quantitative and qualitative evaluation techniques to improve the state of the art in physiological computing, healthcare, home technologies, computer-assisted creativity, and entertainment.” That’s a rich agenda, and the group is in the very forefront of defining how Natural User Interaction (NUI) will enhance our personal and professional lives….

Share this post on Twitter

Email this post to a friend

The almighty ampersand linking R and D

According to Wikipedia, the lowly ampersand or “&” is a logogram representing the conjunction word “and” using “a ligature of the letters in et,” which is of course the Latin word for “and.”

In my line of work I most frequently encounter the ampersand in the common phrase “R&D” for research and development, although I notice that with texting and short-form social media the ampersand is making something of a comeback in frequency of use anyway.

Continue reading

Kinecting Communities

On April 16 I will be speaking at the Mobile Citizen Summit in Washington DC (registration still open), which brings together “practitioners across the  government, nonprofit, advocacy, and political spaces—the kinds of  people who develop the strategy and the tools to reach, engage, educate,  and enable citizens across the country and around the world.”

But I’m going to be talking about “mobile” in a different way than others still use the term, i.e. they focus on a handheld device, while I will be focusing on the mobile citizen. As I have said before I don’t believe our future involves experiencing “augmented reality” by always holding up little 3-inch plastic screens in front of our faces. Natural user interfaces and immersive computing offer much more to how we access computational resources – and how technology will help us interact with one another. Here’s an example, in a story from the past week.

Continue reading

Air Everything

Like many people, I was very impressed by a video over the weekend of the Word Lens real-time translation app for iPhone.  It struck with a viral bang, and within a few days racked up over 2 million YouTube views. What particularly made me smile was digging backwards through the twitter stream of a key Word Lens developer whom I follow, John DeWeese, and finding this pearl of a tweet (right) from several months ago, as he was banging out the app out in my old stomping grounds of the San Francisco Bay Area. That’s a hacker mentality for you :)

But one thought I had in watching the video was, why do I need to be holding the little device in front of me, to get the benefit of its computational resources and display? I’ve seen the studies and predictions that “everything’s going mobile,” but I believe that’s taking too literally the device itself, the form-factor of a little handheld box of magic.

Continue reading

The wikileaks label ticks off Wikipedia cofounder

Note from Lewis: I have commented on the latest Wikileaks outrage elsewhere (Facebook, Twitter), making clear my thoughts for what they’re worth.  Briefly, they summarize in pointing out that the U.S. Government has now allowed a dynamic to emerge without challenge: an “acceptable” intermediary between Traitor and Public. The original insider-threat individual who ripped the 251,000 cables and all the other previously leaked Iraq war data would likely not have been able to simply provide that to the New York Times personally and have it immediately published; they might have turned him in themselves. But the miraculous creation of a self-appointed, self-sanctified group like Wikileaks has allowed motivated groups like the Times & the UK’s Guardian to proclaim that their hands are clean. I find it outrageous. But the government did not press the point after the first major release (Iraq war data) with any forceful intent, so now we’re simply going to see this continue – until an Administration gets serious with criminal charges including treason for anyone involved, right up the chain of those stealing/mediating/publishing classified information.

 An online friend, Larry Sanger, today posted some very thoughtful remarks from a unique perspective – as a cofounder of Wikipedia who obviously is offended among other things by the misleading use of “Wiki-” in the Wikileaks name.  But he makes some other profound points as well. He offered to have them reposted, which I have done below. Reader comments are welcome, either below or as always by email.

 Continue reading

Tearing the Roof off a 2-Terabyte House

I was home last night playing with the new Kinect, integrating it with Twitter, Facebook, and Zune. Particularly because of the last service, I was glad that I got the Xbox 360 model with the 250-gigabyte (gb) hard disk drive. It holds a lot more music, or photos, and of course primarily games and game data.

So we wind up with goofy scenes like my wife zooming along yesterday in Kinect Adventures’ River Rush – not only my photo (right) but in-game photos taken by the Kinect Sensor, sitting there below the TV monitor.

Later as I was waving my hands at the TV screen, swiping magically through the air to sweep through Zune’s albums and songs as if pawing through a shelf of actual LP’s, I absent-mindedly started totting up the data-storage capacity of devices and drives in my household.  Here’s a rough accounting:

  • One Zune music-player, 120gb;
  • 2 old iPods 30gb + 80gb;
  • an iPad 3G at 16gb;
  • one HP netbook 160gb;
  • an aging iMac G5 with 160gb;
  • three Windows laptops of 60gb, 150gb, and 250gb;
  • a DirecTV DVR with a 360gb disk;
  • a single Seagate 750gb external HDD;
  • a few 1gb, 2gb, and a single 32gb SD cards for cameras;
  • a handful of 2gb, 4gb, and one 16gb USB flash drives;
  • and most recently a 250gb Xbox 360, for Kinect. 

All told, I’d estimate that my household data storage capacity totals 2.5 terabytes. A terabyte, you’ll recall, is 1012 bytes, or 1,000,000,000,000 (1 trillion) bytes, or alternately a thousand gigabytes.

Continue reading

Follow

Get every new post delivered to your Inbox.

Join 6,248 other followers

%d bloggers like this: