Luminoso, who turn unstructured text data into business-critical insights, announced the newest features of ConceptNet, an open data semantic network whose development is led by Luminoso Chief Science Officer Robyn Speer. ConceptNet originated from MIT Media Lab’s Open Mind Common Sense project more than two decades ago, and the semantic network is now used in AI applications around the world. ConceptNet is cited in more than 700 AI papers in Google Scholar, and its API is queried over 500,000 times per day from more than 1,000 unique IPs. Luminoso has incorporated ConceptNet into its proprietary natural language understanding technology, QuickLearn 2.0. ConceptNet 5.8 features:
Continuous deployment: ConceptNet is now set up with continuous integration using Jenkins and deployment using AWS Terraform, which will make it faster to deploy new versions of the semantic network and easier for others to set up mirrors of the API.
Additional curation of crowd-sourced data: ConceptNet’s developers have filtered entries from Wiktionary that were introducing hateful terminology to ConceptNet without its context. This is part of their ongoing effort to prevent human biases and prejudices from being built into language models. ConceptNet 5.8 has also updated its Wiktionary parser so that it can handle updated versions of the French and German-language Wiktionary projects.
HTTPS support: Developers can now reach ConceptNet’s website and API over HTTPS, improving data transfer security for applications using ConceptNet.
The best way to start the Gilbane conference is by attending one or two of the pre-conference workshops offered on Tuesday, November 27, 9:00 am to 4:00 pm:
- Insider’s Guide to Selecting WCM Technology – Tony Byrne & Irina Guseva, Real Story Group
- Implementing Systems of Engagement: Making it Work with the Team That Will Make it Work – Scott Liewehr & Rob Rose, Digital Clarity Group
- So You Want to Build a Mobile Content App? – Jonny Kaldor, Kaldor Group (creators of Pugpig)
- Content Migrations: A Field Guide – Deane Barker, Blend Interactive & David Hobbs, David Hobbs Consulting
- Social Media: Creating a Voice & Personality for Your Brand – AJ Gerritson, 451 Marketing
- Text Analytics for Semantic Applications – Tom Reamy, KAPS Group
See the schedule and full descriptions of the in-depth pre-conference workshops.
Please save the date and check http://gilbaneboston.com for further information about the main conference schedule & conference program as they become available.
Intercontinental Boston Waterfront Hotel
510 Atlantic Avenue
Boston, Massachusetts 02210
MuseGlobal announced a partnership with Specialty Systems, Inc., a company focusing on innovative information systems solutions to Federal, State and Local Government customers. Specialty Systems, Inc. is partnering with MuseGlobal to provide the systems integration expertise to engineer law enforcement and homeland security applications built on MuseGlobal’s MuseConnect, which provides federated search and harvesting technologies, with a library of more than 6,000 pre-built source connectors. The applications resulting from this partnership will incorporate unified information access allowing structured data from database sources; semi-structured data from spreadsheets, forms and XML sources; unstructured data from web sites, documents, email; and rich media such as images, video and audio information to be accessed simultaneously from internal databases and external sources. This information is gathered on the fly, and unified for immediate presentation to the requestor. http://www.specialtysystems.com, http://www.museglobal.com
Autonomy Corporation plc (LSE:AU. or AU.L) unveiled a new Expertise Location module for its iManage Universal Search (IUS) solution, the company’s pan-enterprise search solution tailored for law firms. With this new module, lawyers can connect with other subject matter experts within the law firm, enabling the firm to efficiently staff matters and match the right talent with the needs of a case. Powered by Autonomy’s Meaning Based Computing platform, IDOL, the IUS Expertise Location module supports a law firm’s complete practice support needs on a single platform. IUS leverages IDOL’s ability to automatically build conceptual profiles of users based on the understanding of information. As lawyers interact with information as part of their daily routine in the iManage WorkSite document and email management system or IUS such as create or view documents, respond to emails, create time entry systems, etc., IUS can automatically build a profile of the user in real-time. As a result, IUS can identify experts within the law firm and connect their expertise with the most relevant information from multiple content repositories. With IDOL’s meaning-based approach, a user can find an expert even if the exact search terms are not included in the users’ profile. For example, searching for an expert in ‘copyright law’ will also show experts in conceptually related areas, such as ‘trademark’. The Expertise Location module is an add-on to iManage Universal Search and is available now. http://www.autonomy.com/imanage
As the Internet continues to redefine ubiquitous, the issue of cross language search becomes more critical. It’s a pervasive challenge with extreme scalability requirements. Hard to imagine, but the Internet will be full by about 2010 according to the American Registry for Internet Numbers. ARIN’s recommendation for IPv6 demonstrates the potential breadth of information overload.
Organizations such as the European-based Cross-Language Evaluation Forum (CLEF) have moved beyond discussion and into in-depth testing on cross-language search for many years. With its “Leaping over Language Barriers” announcement, Google has moved beyond experimentation and toward productization of its cross-language search feature.
- The Wall Street Journal’s Jessica Vascellaro weighs in here, and includes commentary on rival strategies from Yahoo and Microsoft.
- Google Blogoscoped weighs in here.
- Clay Tablet’s Ryan Coleman weighs in here.
- Global by Design’s John Yunker has a review here.
- And from Google themselves, here’s the beta UI, the FAQ, and the “unveiling” at the company’s Searchology event held earlier this month.
IMO, any discussion of what the interconnected world “looks like” in the future, whether focused on fill in your label here 2.0, social networking, customer experience, global elearning, etc., (should) eventually drill-down to translation and localization issues. Once we’re at that level of conversation, there’s more challenges to discuss — the ongoing evolution of automated translation, the balance between human and machine translation, the conundrum of rich media and image translation, and as Kaija will always remind us, the quality and context of search results as opposed to merely the quantity.
As a researcher, I’ve used Google’s “translate this” functionality and Yahoo’s Babel Fish (originally AltaVista’s) numerous times to “get the gist” of a non-English article. But my reliance on the results has been more for sanity-checking trends than for factual data gathering. Inconsistencies skew the truth. I just can’t trust it. Can we trust this? Time will tell. Is it a step in the right direction for the masses? No doubt.