Opening IT Up: Using Open Source Software
Carla Schroer, Cultural Heritage Imaging, (A California Nonprofit Corp.) 20 years of open-source experience.Carla at c-h-i dot org, http://www.c-h-i.org
About me: 20 years in Silicon Valley, 13 at Sun., etc.
Open Source is a licensing model for software. It tells you nothing about the actual software, how it was developed, whether support is available, etc.
Open source licenses have generally been approved by OSI (open source initiative). REquires free distribution, available source code, and people are allowed to modify source code. There are several major types of license:
Permissive Licenses (BSD,MIT,Apache)
Copyleft (Mozilla, GGLGPL, Eurpoean Union Public License (EUPL)
Strong copyleft licenses - Gnu General Public License GPL.
What distinguishes? What you get to do with code downstram. Under a permissive code, I can take and relicense under another license, etc.
A copyleft license, by contrast, I have to release any modifications under same license. Can't go closed/into commercial thing. THere are two flavors. In lightweight, it's file based. So I could fix bugs in one file and release it. (But could release plugin under any license I chose.)
Strong copyleft licenses mean that it is viral.Anything that touches licensed code required to have same license. Strong copyleft licenses are project based and can effect code beyond the original licensed code.
Update: Carla wrote me a note clarifying her point on this:
Permissive license allows much use with few restrictions, Copyleft ensure that work based on license remains open.
These are not all compatible -- you can't combine some of them. Software freedom law center writes a lot about this. There are very few legal precedents, so some is still up in air.
Choosing software: You should ask the same questions you would ask for Open Source as for non-open source. An important thing is what is the cost of switching if it doesn't work out. Also what's the TCO? The cost of the license is one factor. But you still have all the other costs -- training, sustaining, maintenance, etc.
Probably the most controversial thing I'll say this AM. I believe that open source can help mitigate concerns about adopting new file formats and standards. The ability to adopt a new technique is mitigated because the code is open source -- because it's open, it's more likely to be available and usable in future.
CHI Ongoing Collaborations (include Worcester Art Museum, etc.) I'm going t talk about a specific project where we used leadership grant from IMLS and built software tool. Worked with team from UC Santa Cruz. Their standard license for having grad students work requires the U. to have all IP rights. Took some serious negotiation. By contrast , the Italian National Research Council probably wouldn't have been involved unless it was an open source model. An important thing is that everyone knows what the terms will be from the beginning.
Another issue is copyright ownership. So we ended up creating a joint copyright with the people who write the code, which gives us the rights to license and still lets them do stuff with it.
Open Source can be a tool to get people working together for the common good. License terms need to be agreed upon up front. And you need to make sure you choose collaborators with same goals.
Open source is a tool, not a religion. There are a range of licenses, good for different things.
Carla at c-h-i.org, http://www.c-h-i.org
Christopher J. Mackie, Associate Program Officer, REsearch in Information Technology Program, the Andrew W. Mellon Foundation.
Why Open Source? (My views don't represent foundation, if I say something egregiously stupid, it doesn't represent the foundation!)
We think we are the largest NGO funder of open source software. (We'd love to see someone bigger doing that. We have 30+ prokects with users and developers on every continent. http://rit.mellon.org. All major media sites picked up some visualization tools for historical timelines we made for historians and used them in their coverage. All is open source, most is community source.
Open Source isn't always a bunch of people working in their garage in the evenings. It can be big business.
Open source is a licensing scheme (or sometimes a religious belief)l it's not
a guarantee of freedom or success
a sustainability model
a software architecture (in itself)
A single organizational model
a (good) technology strategy in and of itself.
So why use it?
Open source is governed by and for users. Can't be bought, closed. No one can buy project and foreclose it or require upgrade. Ownership is key.
Some people want open source for cost savings. This can happen if you try, but not always primary motivation. Biggest reason for adoption is risk management. It diffuses risk of new development across many institutions. (As long as project is well-run and viable.)
Why not? The Mickey and Judy model. Local optimization/Competency trap. (My mom's got costumes, your dad has a barn, let's put on show!) Bad tech strategy is bad strategy --doesn't matter if open source.
OS tends to cause developer centrism instead of user centrism. Stakeholders want to know what it does for me, and many OS projects have trouble answering that.
OS software brought in badly can subvert an overall technology strategy. Must be smart about it.
OSS Business Model. Most projects don't have one. Some have dual licensing. There is good (dual commercial/noncommercial). If vendor is well intentioned, these can be valuable. Evil is platinum edition, where the OS version is just a crippled version of the good one which is proprietary. That's not really open source.There's the services model, where software is oss but you pay someone to support. There's the appliance model where you are sold a box. There's hosting where the vendor does everything. And then there's software as service. Like hosting, but the vendor sets up internal infrastructure differently.
Varieties of OSS. Traditional is developer driven. Can be a terrific model if customers are developers. But may fail if developers are not final customers.
Single vendor driven - many companies allow you to download, but then try to become a monopoly vendor for it.
Most of the benefits of OSS only come if you can fire the vendor without finring the vendor.
And then there's what we support, which is community source, functionally driven. Collection Space is one of our projects like that.
Community source software is designed and built by and for the functional specialists for their community. We have ways of having these people get together and design a blueprint.Community owned -- when the collection space project is done, it will turn over IP to a foundation. It's community sustained - by contributions, and/or by a healthy vendor ecosystem. And it's state of the art in governance, tech, sociology... and sustainability.
We've done XX many projects, many of which are out of funding, and none of which have died.
What does it take? Not wealth. Vendors make it affordable. Ongoing support costs are 40-60% less than commercial, and vendors allow people to buy support instead of hiring developers. Mostly it takes organizational buy in and a strategic technology plan.
Strategic Plans and critical mass -- CriticalMASS (Mission, agility, sovereignty and sustainability) No strat. tech plan is a plan, but a really bad one.
Strategic Software: Gain buy in from stakeholders and executives. Contextualize CriticalMASS values for your institution. Evaluate resources holistically. If you do all this stuff, we think you'll end up with community source. But if you don't, that's fine.
Carl Goodman, Senior Deputy Director, Museum of Moving Image
Cgoodman at movingimage dot us
PI for Collections Space project
www.collectionspace.org
Getting comfortable with OSS. You're not on your own if you use OSS - you may have more support. The palmolive principle - you're already swimming in OSS everywhere. Wordpress, firefox, drupal, programming languages, Shopping Carts (OSCommerce), course management (Moodle)
Crisis breeds will to try new approaches. People who were more conservative may be willing to try new things.
Our project is not your father's open source project. THis is not people in their free time creating` spaghetti code.
Next generation web bases collection information, management and access platform that happens to be open source. But open source is not the defining point of the software. A number of partners on the project, and funded by Mellon.
Grew out of our work on a homegrown collections system called OpenCollection. Won an award for that. Led to larger grant. It became clear that we needed to step back and reinvent the software and work to create culture and community around it. BAsed on idea that managing, acquiring, disseminating collections is a core activity, as is getting it online, but difficult.
We are developing an alternative to commercial or homegrown CMS.
We want to leverage university partners culture of research and innovations, recognition that they have museums of constitutents. Museums do have developers, but they are booked. And most museums don't.
We're also trying to put UI design up front. The fluid project, which we're working with, is working to create UI elements for incorporation into OSS. We in collections management deserve usable software!
Brought in 40 people representing 20 organizations to think about what a new collections system would look like. Transparency. This is hard for museums. We are doing this in a way so that the community can be involved at any stage and see all debates, discussions, and mistakes. Project website and wiki are very detailed. We're taking a coordinated and highly structured approach to distributed software development. We have 12 people working full time on it, on a 2.5 year project.
We're decoupling various aspects of prokect from each other so that they can innovate and still stay in touch with each other. The functional team is working on the needs requirements for first phase pof project. There is a design UX team looking at user interfaces, use cases, etc.
The technical team is working on underlying architecture and tools.
Timeframe for al this. Tech platform Dec. 08. Development begins Jan. 09. Tire kicker march 09.
Sustainability- we are working on this as well.
THere is a lot of OSS inside the system. JAva/PHP or python or rails, Fedora, etc. Imagemagick, etc. And we hope parts of our project end up inside other projects. For example project OLE at Duke, or Bamboo. Omeka, Pachyderm. Other projects are further along, and have been helpful to us even though in different domain.
Brad Westbrook, Archivist's TOolkit Project Manager, and X at UC San Diego. MLS from UCLA, and MA in English from Suny Albany.
Overview of AT and how it became OSS, and some of our lessons learned. Mary contrasted us with Collection Space, and noted that we're more mature. True to some degree. But also more immature. They are already into sustainability, etc. That's something we came to belatedly.
It's an OSS RDB collection system. Purposely a staff-side tool - not targeted at external clients. Start up funding in 2002 by Digital Library Federation, and two development cycles funded by Mellon Foundation. Three public releases to date. Dec 2006, last january, and last one was wednesday night from this hotel. We are pleased with uptake. We have 40 or so institutions that have implemented as production tool and being available for other users as a resource.
Some institutions include Getty, Museum of flight, vermont folklife center, bates college, Princetion, UCLA, etc.
Designed to address key problems in archives domain. Serialized processing tools. One task done by one tool, another by another. A lot of redundant data entry and siloing, as well as inefficiency and increased training cost.
Also resulted in data with low interoperabiliuty. The online archive of CA found this out in 2001 when they found that the EADs that had been submitted. There was tremendous variability despite being national standard.
Also thought the tool could help reduce growing archival backlogs.
Solution was to build program that would promote standardization (DACS, ISARR (authority recs)
Supports export standards like EAD, MARCXML, METS, MODS, DC)
Also promote efficiently by integrating functions, enabling repurposing of data, automating encoding and reporting, and providing customization features. In the end, we thought it could decrease the cost of processing and improve sharing across community.
So why Open Source? We wanted it to be affordable. We wanted it to be on an enterprise quality database. Oracle and SQL Server were cost barriers for organization. So chose to go with MySQL. ALso, we wanted flexibility and adaptability. One of the complaints is that a vendor will give you SW that does something, but you can't modify it -- you take what you get. So if it meets most of your needs, you take it and adapt. We wanted a tool that orgs could modify. We liked the community volunteer model. Giving the users oversight for development priorities and features request, documentation, etc.
We're released under the Educational COmmunity License 1.0. Did this belatedly. Finally after wrestling with Apache or GPL. But felt ECL was favorably received by funder (we thought) and allowed certain commercial opportunities that might not be there with GPL. USed a lot of third party OSS to build out, and we began transition to user governance. We've shifted from sharing requirements with a small group of specialists to doing it with the community at large.
We worked with SAA to establish workshops for training. THere have been 5 in 2008 so far. We'll begin usability testing using AT users in NYC area. And we petitioned SAA to create AT roundtable, which will hopefully become seat for users to take over governance.
What we've learned? Match between product type and open source strategy. We probably at the beginning thought we'd have developers climbing out of windows hoping to contribute. We've finally come to the conclusion that this is because it's targeted at a very small group of users, and contributing requires domain knowledge. So we will probably not have many developers contributing.
Another thing is to start sustainability planning and think about license much earlier in process. We ended up with a mixture of code with some GPL and some not, which limits what we can do.
Q: How do you measure sustainability and suitability? A: there are some indexes out there like business readiness index. But these only rate big projects, are subjective, etc. SO hard to use. Important to look at how many people involved, how active community is, etc. Ideally you want a community with a diversity of vendors. A project where one institution does most of the work has a single point of failure. Etc.
Q: (Ari) Is there an equivalent for Code4Lib for archives/museums, and/or could this be expanded to code for cultural heritage. To help with developer domain knowledge problem? Bill - aware of code4lib. We've had a presence there due to Mark McKenzie. We've started announcing releases there. How we can take that further.
My comment: Importance of a good plugin architecture, documentation and example code to getting community development. Much harder if you have to understand entire project architecture just to make a small change.
Q: If Firefox dies, you lose your bookmarks. If your collections system dies, you lose everything? A: Disaster planning is important. You have to think about that. One thing people say is "with commercial I can always call someone." But that assumes that they will answer the phone at 5 am on sunday, that they will have an answer quickly, etc. Ideally your disaster plan should be somewhat beyond and independent of just calling someone.
Q: Ari - we started using ATK, but were flummoxed when we realized it didn't have built in integration with Fedora. WHat kind of planning is going into this sort of integration. A: We've put some thought into this, particularly with Fedora and Dspace. Haven't done a ton of planning beyond that - but would wecome community contributions on that. A: CollectionSpace is taking a lot of pains to make sure that we will be open to this type of integration. One of our use cases is Fedora at UC Berkeley. So instead of thinking of as system we're trying to see it as granular and open to these types of cobinations.
Q: You talked about single vendor projects. Can you give an example. A: one good example is Zimbra. If you want to run alongside an exchange server (which is how most people use it) you have to buy from them. 35b went into OSS over last few years, and much of it went into these single vendor solutions.
There's another strategy where marketing material says it's open, and what they really mean is that they have a proprietary API you can write to.
Q: Adobe FLex? Q: Like PDF is open source? A: PDF is an open standard, but not open source in that Acrobat is not open. But the standard is open. That's an important distinction.
Q: I've found that OSS software is held to higher standard than commercial - should be cheaper, offer more flexibility easily, etc. How do you manage this and help with integration and acceptance? A: There tends to be an overselling to leadership. A lot of that is our own fault. People in their zeal to make case overpromise. There isn't a substitute for educating your own leadership. They have to understand what is and isn't deliverable. People go in and say things that they won't be able to follow through on. The easy answer is don't do that. But it's hard, because you're facing the uphill struggle of adopting this new model. The way to short circuit this whole dynamic is to start with a strategic technology plan. If you do that and they're making informed decisions, I expect in most cases with rational people they will understand.
Q: People talk about free as in beer or free as in speech. Well, OSS is free as in kittens. That's a good model to follow.
Q: I come from the other side - our leadership is saying "yeah we need OSS because it's free." But that's a simplistic attitude. Need more strategic planning.
Q: Why aren't strategic technology planning more common? A: Technologists aren't always trained to do this. And at a large organization you learn how to do this. But as CTO in small museum, you may stumble on this, but you're less likely to be trained to do it. And a lot is about opportunity. One reason OSS projects are powerful is that they allow small institutions that don't have resources internally to find resources at the community level and bootstrap themselves. Big orgs often come in for their own reasons -- to demonstrate need for internal developers, etc. Smaller organizations without these resources come in later, but save more because they don't have to do everything internally.
Good end note: need for strategic technology plan!
About me: 20 years in Silicon Valley, 13 at Sun., etc.
Open Source is a licensing model for software. It tells you nothing about the actual software, how it was developed, whether support is available, etc.
Open source licenses have generally been approved by OSI (open source initiative). REquires free distribution, available source code, and people are allowed to modify source code. There are several major types of license:
Permissive Licenses (BSD,MIT,Apache)
Copyleft (Mozilla, GGLGPL, Eurpoean Union Public License (EUPL)
Strong copyleft licenses - Gnu General Public License GPL.
What distinguishes? What you get to do with code downstram. Under a permissive code, I can take and relicense under another license, etc.
A copyleft license, by contrast, I have to release any modifications under same license. Can't go closed/into commercial thing. THere are two flavors. In lightweight, it's file based. So I could fix bugs in one file and release it. (But could release plugin under any license I chose.)
Strong copyleft licenses mean that it is viral.
Update: Carla wrote me a note clarifying her point on this:
There is one thing in your description of the licenses part that isn't quite right, and makes a difference to me. This is in the area of strong copyleft licenses. You say:
Strong copyleft licenses mean that it is viral. Anything that touches licensed code required to have same license.
While I think a lot of people believe this to be true, I strive really hard not to go that far in talking about the copyleft effect of the GPL licenses. My slide, and I hope my presentation, said that under some circumstances the copyleft effect can go beyond the original files. I didn't have time to get into the vagaries of this in a short overview, and I recommended some resources for folks that wanted to understand the boundaries a bit further. I think the GPL licenses are really valuable for some situations, and that folks are unduly scared away from it due to fears about the extent of the copyleft effect. It is true that it can affect files that are not part of the original GPL code, but I wouldn't go so far as to say that it affects everything that touches it. (the license talks about "linking with" and also the act of distributing code together can trigger the effect). There are different legal interpretations on what circumstances trigger the requirement, though some things are quite clear.
Permissive license allows much use with few restrictions, Copyleft ensure that work based on license remains open.
These are not all compatible -- you can't combine some of them. Software freedom law center writes a lot about this. There are very few legal precedents, so some is still up in air.
Choosing software: You should ask the same questions you would ask for Open Source as for non-open source. An important thing is what is the cost of switching if it doesn't work out. Also what's the TCO? The cost of the license is one factor. But you still have all the other costs -- training, sustaining, maintenance, etc.
Probably the most controversial thing I'll say this AM. I believe that open source can help mitigate concerns about adopting new file formats and standards. The ability to adopt a new technique is mitigated because the code is open source -- because it's open, it's more likely to be available and usable in future.
CHI Ongoing Collaborations (include Worcester Art Museum, etc.) I'm going t talk about a specific project where we used leadership grant from IMLS and built software tool. Worked with team from UC Santa Cruz. Their standard license for having grad students work requires the U. to have all IP rights. Took some serious negotiation. By contrast , the Italian National Research Council probably wouldn't have been involved unless it was an open source model. An important thing is that everyone knows what the terms will be from the beginning.
Another issue is copyright ownership. So we ended up creating a joint copyright with the people who write the code, which gives us the rights to license and still lets them do stuff with it.
Open Source can be a tool to get people working together for the common good. License terms need to be agreed upon up front. And you need to make sure you choose collaborators with same goals.
Open source is a tool, not a religion. There are a range of licenses, good for different things.
Carla at c-h-i.org, http://www.c-h-i.org
Christopher J. Mackie, Associate Program Officer, REsearch in Information Technology Program, the Andrew W. Mellon Foundation.
Why Open Source? (My views don't represent foundation, if I say something egregiously stupid, it doesn't represent the foundation!)
We think we are the largest NGO funder of open source software. (We'd love to see someone bigger doing that. We have 30+ prokects with users and developers on every continent. http://rit.mellon.org. All major media sites picked up some visualization tools for historical timelines we made for historians and used them in their coverage. All is open source, most is community source.
Open Source isn't always a bunch of people working in their garage in the evenings. It can be big business.
Open source is a licensing scheme (or sometimes a religious belief)l it's not
a guarantee of freedom or success
a sustainability model
a software architecture (in itself)
A single organizational model
a (good) technology strategy in and of itself.
So why use it?
Open source is governed by and for users. Can't be bought, closed. No one can buy project and foreclose it or require upgrade. Ownership is key.
Some people want open source for cost savings. This can happen if you try, but not always primary motivation. Biggest reason for adoption is risk management. It diffuses risk of new development across many institutions. (As long as project is well-run and viable.)
Why not? The Mickey and Judy model. Local optimization/Competency trap. (My mom's got costumes, your dad has a barn, let's put on show!) Bad tech strategy is bad strategy --doesn't matter if open source.
OS tends to cause developer centrism instead of user centrism. Stakeholders want to know what it does for me, and many OS projects have trouble answering that.
OS software brought in badly can subvert an overall technology strategy. Must be smart about it.
OSS Business Model. Most projects don't have one. Some have dual licensing. There is good (dual commercial/noncommercial). If vendor is well intentioned, these can be valuable. Evil is platinum edition, where the OS version is just a crippled version of the good one which is proprietary. That's not really open source.There's the services model, where software is oss but you pay someone to support. There's the appliance model where you are sold a box. There's hosting where the vendor does everything. And then there's software as service. Like hosting, but the vendor sets up internal infrastructure differently.
Varieties of OSS. Traditional is developer driven. Can be a terrific model if customers are developers. But may fail if developers are not final customers.
Single vendor driven - many companies allow you to download, but then try to become a monopoly vendor for it.
Most of the benefits of OSS only come if you can fire the vendor without finring the vendor.
And then there's what we support, which is community source, functionally driven. Collection Space is one of our projects like that.
Community source software is designed and built by and for the functional specialists for their community. We have ways of having these people get together and design a blueprint.Community owned -- when the collection space project is done, it will turn over IP to a foundation. It's community sustained - by contributions, and/or by a healthy vendor ecosystem. And it's state of the art in governance, tech, sociology... and sustainability.
We've done XX many projects, many of which are out of funding, and none of which have died.
What does it take? Not wealth. Vendors make it affordable. Ongoing support costs are 40-60% less than commercial, and vendors allow people to buy support instead of hiring developers. Mostly it takes organizational buy in and a strategic technology plan.
Strategic Plans and critical mass -- CriticalMASS (Mission, agility, sovereignty and sustainability) No strat. tech plan is a plan, but a really bad one.
Strategic Software: Gain buy in from stakeholders and executives. Contextualize CriticalMASS values for your institution. Evaluate resources holistically. If you do all this stuff, we think you'll end up with community source. But if you don't, that's fine.
Carl Goodman, Senior Deputy Director, Museum of Moving Image
Cgoodman at movingimage dot us
PI for Collections Space project
www.collectionspace.org
Getting comfortable with OSS. You're not on your own if you use OSS - you may have more support. The palmolive principle - you're already swimming in OSS everywhere. Wordpress, firefox, drupal, programming languages, Shopping Carts (OSCommerce), course management (Moodle)
Crisis breeds will to try new approaches. People who were more conservative may be willing to try new things.
Our project is not your father's open source project. THis is not people in their free time creating` spaghetti code.
Next generation web bases collection information, management and access platform that happens to be open source. But open source is not the defining point of the software. A number of partners on the project, and funded by Mellon.
Grew out of our work on a homegrown collections system called OpenCollection. Won an award for that. Led to larger grant. It became clear that we needed to step back and reinvent the software and work to create culture and community around it. BAsed on idea that managing, acquiring, disseminating collections is a core activity, as is getting it online, but difficult.
We are developing an alternative to commercial or homegrown CMS.
We want to leverage university partners culture of research and innovations, recognition that they have museums of constitutents. Museums do have developers, but they are booked. And most museums don't.
We're also trying to put UI design up front. The fluid project, which we're working with, is working to create UI elements for incorporation into OSS. We in collections management deserve usable software!
Brought in 40 people representing 20 organizations to think about what a new collections system would look like. Transparency. This is hard for museums. We are doing this in a way so that the community can be involved at any stage and see all debates, discussions, and mistakes. Project website and wiki are very detailed. We're taking a coordinated and highly structured approach to distributed software development. We have 12 people working full time on it, on a 2.5 year project.
We're decoupling various aspects of prokect from each other so that they can innovate and still stay in touch with each other. The functional team is working on the needs requirements for first phase pof project. There is a design UX team looking at user interfaces, use cases, etc.
The technical team is working on underlying architecture and tools.
Timeframe for al this. Tech platform Dec. 08. Development begins Jan. 09. Tire kicker march 09.
Sustainability- we are working on this as well.
THere is a lot of OSS inside the system. JAva/PHP or python or rails, Fedora, etc. Imagemagick, etc. And we hope parts of our project end up inside other projects. For example project OLE at Duke, or Bamboo. Omeka, Pachyderm. Other projects are further along, and have been helpful to us even though in different domain.
Brad Westbrook, Archivist's TOolkit Project Manager, and X at UC San Diego. MLS from UCLA, and MA in English from Suny Albany.
Overview of AT and how it became OSS, and some of our lessons learned. Mary contrasted us with Collection Space, and noted that we're more mature. True to some degree. But also more immature. They are already into sustainability, etc. That's something we came to belatedly.
It's an OSS RDB collection system. Purposely a staff-side tool - not targeted at external clients. Start up funding in 2002 by Digital Library Federation, and two development cycles funded by Mellon Foundation. Three public releases to date. Dec 2006, last january, and last one was wednesday night from this hotel. We are pleased with uptake. We have 40 or so institutions that have implemented as production tool and being available for other users as a resource.
Some institutions include Getty, Museum of flight, vermont folklife center, bates college, Princetion, UCLA, etc.
Designed to address key problems in archives domain. Serialized processing tools. One task done by one tool, another by another. A lot of redundant data entry and siloing, as well as inefficiency and increased training cost.
Also resulted in data with low interoperabiliuty. The online archive of CA found this out in 2001 when they found that the EADs that had been submitted. There was tremendous variability despite being national standard.
Also thought the tool could help reduce growing archival backlogs.
Solution was to build program that would promote standardization (DACS, ISARR (authority recs)
Supports export standards like EAD, MARCXML, METS, MODS, DC)
Also promote efficiently by integrating functions, enabling repurposing of data, automating encoding and reporting, and providing customization features. In the end, we thought it could decrease the cost of processing and improve sharing across community.
So why Open Source? We wanted it to be affordable. We wanted it to be on an enterprise quality database. Oracle and SQL Server were cost barriers for organization. So chose to go with MySQL. ALso, we wanted flexibility and adaptability. One of the complaints is that a vendor will give you SW that does something, but you can't modify it -- you take what you get. So if it meets most of your needs, you take it and adapt. We wanted a tool that orgs could modify. We liked the community volunteer model. Giving the users oversight for development priorities and features request, documentation, etc.
We're released under the Educational COmmunity License 1.0. Did this belatedly. Finally after wrestling with Apache or GPL. But felt ECL was favorably received by funder (we thought) and allowed certain commercial opportunities that might not be there with GPL. USed a lot of third party OSS to build out, and we began transition to user governance. We've shifted from sharing requirements with a small group of specialists to doing it with the community at large.
We worked with SAA to establish workshops for training. THere have been 5 in 2008 so far. We'll begin usability testing using AT users in NYC area. And we petitioned SAA to create AT roundtable, which will hopefully become seat for users to take over governance.
What we've learned? Match between product type and open source strategy. We probably at the beginning thought we'd have developers climbing out of windows hoping to contribute. We've finally come to the conclusion that this is because it's targeted at a very small group of users, and contributing requires domain knowledge. So we will probably not have many developers contributing.
Another thing is to start sustainability planning and think about license much earlier in process. We ended up with a mixture of code with some GPL and some not, which limits what we can do.
Q: How do you measure sustainability and suitability? A: there are some indexes out there like business readiness index. But these only rate big projects, are subjective, etc. SO hard to use. Important to look at how many people involved, how active community is, etc. Ideally you want a community with a diversity of vendors. A project where one institution does most of the work has a single point of failure. Etc.
Q: (Ari) Is there an equivalent for Code4Lib for archives/museums, and/or could this be expanded to code for cultural heritage. To help with developer domain knowledge problem? Bill - aware of code4lib. We've had a presence there due to Mark McKenzie. We've started announcing releases there. How we can take that further.
My comment: Importance of a good plugin architecture, documentation and example code to getting community development. Much harder if you have to understand entire project architecture just to make a small change.
Q: If Firefox dies, you lose your bookmarks. If your collections system dies, you lose everything? A: Disaster planning is important. You have to think about that. One thing people say is "with commercial I can always call someone." But that assumes that they will answer the phone at 5 am on sunday, that they will have an answer quickly, etc. Ideally your disaster plan should be somewhat beyond and independent of just calling someone.
Q: Ari - we started using ATK, but were flummoxed when we realized it didn't have built in integration with Fedora. WHat kind of planning is going into this sort of integration. A: We've put some thought into this, particularly with Fedora and Dspace. Haven't done a ton of planning beyond that - but would wecome community contributions on that. A: CollectionSpace is taking a lot of pains to make sure that we will be open to this type of integration. One of our use cases is Fedora at UC Berkeley. So instead of thinking of as system we're trying to see it as granular and open to these types of cobinations.
Q: You talked about single vendor projects. Can you give an example. A: one good example is Zimbra. If you want to run alongside an exchange server (which is how most people use it) you have to buy from them. 35b went into OSS over last few years, and much of it went into these single vendor solutions.
There's another strategy where marketing material says it's open, and what they really mean is that they have a proprietary API you can write to.
Q: Adobe FLex? Q: Like PDF is open source? A: PDF is an open standard, but not open source in that Acrobat is not open. But the standard is open. That's an important distinction.
Q: I've found that OSS software is held to higher standard than commercial - should be cheaper, offer more flexibility easily, etc. How do you manage this and help with integration and acceptance? A: There tends to be an overselling to leadership. A lot of that is our own fault. People in their zeal to make case overpromise. There isn't a substitute for educating your own leadership. They have to understand what is and isn't deliverable. People go in and say things that they won't be able to follow through on. The easy answer is don't do that. But it's hard, because you're facing the uphill struggle of adopting this new model. The way to short circuit this whole dynamic is to start with a strategic technology plan. If you do that and they're making informed decisions, I expect in most cases with rational people they will understand.
Q: People talk about free as in beer or free as in speech. Well, OSS is free as in kittens. That's a good model to follow.
Q: I come from the other side - our leadership is saying "yeah we need OSS because it's free." But that's a simplistic attitude. Need more strategic planning.
Q: Why aren't strategic technology planning more common? A: Technologists aren't always trained to do this. And at a large organization you learn how to do this. But as CTO in small museum, you may stumble on this, but you're less likely to be trained to do it. And a lot is about opportunity. One reason OSS projects are powerful is that they allow small institutions that don't have resources internally to find resources at the community level and bootstrap themselves. Big orgs often come in for their own reasons -- to demonstrate need for internal developers, etc. Smaller organizations without these resources come in later, but save more because they don't have to do everything internally.
Good end note: need for strategic technology plan!
Labels: MCN2008
