The open source software movement raises difficult questions for CIO’s:
- Is open source software “free”?
- If not, what are its costs and risks?
- Does using open source software save time in deploying an application?
- What uses are best suited to open source software?
The answer to all of these questions is, unfortunately, “it depends”. Using open source software effectively depends on the type of application and on the expertise of the developers. It also requires the same kinds of trade-offs that are necessitated by any choice of software: how customized does it have to be? How accurate? How scalable? How usable and for which types of users? This is particularly true in the realm of search and text analytics because both of these applications are language dependent, with all the nuances, variety and complexity that language brings.
We find widespread use of open source components by commercial software vendors. They use open source search or text analytics as a starting point. Then they add in the vocabularies, domain knowledge, tools and widgets, connectors to other applications and information stores, process knowledge and user interaction design to create usable and scalable applications that are suited to a specific purpose. We also find sophisticated enterprises with enough skilled developers, computational linguists, and interaction designers using open source software to give them the custom applications they need. There is no doubt that as open source applications have become more robust and the tools to use them have become available that they are an attractive alternative for many enterprises. But are they “free?” Not if you consider the time, labor and expertise needed to make them an integral, useful part of the enterprise stack.
I’ll be chairing a one-day program on open source search software on Nov. 6th in Chantilly, VA, near Washington, DC that will discuss these questions. We’ve invited some major open source search developers from Elastic Search, Sphinx, Lucene, Solr, as well as vendors who have embedded open source software in their products. Practitioners will discuss their experience with developing applications using open source as well. Eric Brown, Director of Research for IBM Watson, which embeds multiple open source products, will give the keynote, and Donna Harman from NIST’s TREC will discuss how to evaluate search effectiveness. Government employees can register for the event free. Others will get a discount on the registration fee by entering feldman2013 when they register.
In addition, we are collecting data on use of both commercial and open source search and text analytics and are hoping that you will fill in our survey. Results will be tabulated, and all respondents will receive a summary of what we find. You can find the survey at: https://www.surveymonkey.com/s/Synthexis
I hope to see you in November.