Tag Archives: databases

Crowdsourced hate speech database could spot early signs of genocide

The use of hate speech to dehumanize people is widely recognized as one of the first steps towards genocide. From Rwanda, where Hutu radio stations blared out propaganda referring to Tutsis as “cockroaches,” to Nazi Germany, where Jews were likened to a disease that needed to be cleansed from society, hate speech has been a clear warning sign of terrible things to come.

Hatebase, a new crowdsourced database of multilingual hate speech from The Sentinel Project, is an attempt to create a repository of words and phrases that researchers can use to detect the early stages of genocide.

“How many people outside of Sri Lanka know that ‘sakkiliya’ is a Sinhala term used to refer to a Tamil person as ‘a very unhygienic or uncultured person’,” Christopher Tuckwood, executive director of The Sentinel Project, told Wired.co.uk. “Hatebase helps us to know what to look for and to make sense of what we see.”

Read 6 remaining paragraphs | Comments

Native JSON features submitted for PostgreSQL 9.2



The upcoming 9.2 release of PostgreSQL could introduce a native JSON data type and support for returning JSON-formatted query results. The patches that implement the JSON functionality were submitted by contributors Robert Haas and Andrew Dunstan in the open PostgreSQL CommittFest.

The patches, which are currently awaiting review, highlight the important role that JSON plays in Web programming. JSON is growing in popularity as a data interchange format and is increasingly favored over XML as the standard output for Web application APIs.

Haas implemented a core JSON type that does basic validation. Dunstan expanded on that by implementing query_to_json, record_to_json, and array_to_json functions that can be used in SQL queries to generate structured JSON responses.

These features could be useful for Web application developers who are implementing API endpoints that return JSON-formatted database records. The features could also be useful in cases where developers want to be able parse database output into structured objects in their code without having to introduce the overhead of full-blown object-relational mapping.

Native JSON storage is one of the features that helped popularize CouchDB, a document-oriented database that has become one of the favorites of the NoSQL crowd. The JSON features proposed for PostgreSQL obviously won’t provide fully equivalent capabilities, but certainly boost PostgreSQL’s usefulness.

Read the comments on this post