Biomedical research becomes reliant on multi-disciplinary,
multi-institutional collaboration, and data sharing is becoming increasingly important for researchers to reuse experiments, pool expertise and validate approaches. However, there are many hurdles for data sharing, including the unwillingness to share,  lack of flexible data model for providing context information for shared data,  difficulty to share syntactically and semantically consistent data across distributed institutions, and expensive cost to provide tools to share the data.  In our work,  we develop a Web-based collaborative biomedical data sharing platform {\em SciPort} to support biomedical data sharing across distributed organizations. SciPort provides a generic metadata model for researchers to flexibly customize and organize the data. To enable convenient data sharing, SciPort provides a central server based data sharing architecture, where data can be shared by one click through publishing metadata to the central server. To enable consistent data sharing, SciPort provides collaborative distributed schema management across distributed sites. To enable semantic consistency for data sharing, SciPort provides semantic tagging through controlled vocabularies. SciPort is lightweight and can be easily deployed for building data sharing communities for biomedical research.

With increased complexity of scientific problems, biomedical
research is increasingly a collaborative effort across multiple
institutions and disciplines.  Data sharing is becoming critical for
validating approaches and ensuring that future research can build on
previous efforts and discoveries. As a result, data sharing is often
required by scientific funding agencies to share the data produced
in grant projects. For example,  the National Institutes of Health
(NIH) of US requires data sharing for NIH funded projects of
\$500,000 or more in direct costs in any one year.

To support large scale collaborative biomedical research, NIH
provides large-scale collaborative project awards  for a team of
independently funded investigators to synergize and integrate their
efforts, and the awards mandate the research results and data to be
shared (\cite{Meng}).  The Network for
Translational Research (NTR): Optical Imaging in Multimodality
Platforms (\cite{Silva}) is one of such collaborative projects on the
development, optimization, and validation of imaging methods and
protocols for rapid translation to clinical environments.  It
requires not only managing the complex scientific research results,
but also sharing the data across hundreds of research collaborators.
As another example, Siemens Healthcare has research collaborations
with hundreds of research sites distributed across the US, each
providing Siemens marketing support by periodically delivering white
papers, case reports, clinic methods, clinic protocols,
state-of-the-art images, etc. In the past, there were no convenient
methods for research partners to share data with Siemens, and mostly
data were delivered through media such as emails, CDs and hard
copies. This made it very difficult to organize, query and integrate
the shared data.

