The main goal of the team is to gain users' confidence in workflow execution results by enhancing results with provenance information, trust recommendations, and levels of uncertainty. Provenance, trust, and uncertainty about the results of cyber-infrastructure-based applications, e.g., scientific workflows, are essential if scientists are to believe and, thus accept, these results. This subproject will address the complex problem of using provenance as the key enabler for integrating trust management and uncertainty management in distributed environments like the grid. The subproject will support research leading to a uniform way of representing uncertainty and trust. Uncertainty models embedded in provenance will be comprehensive enough to support the computation of several dimensions of uncertainty including error, accuracy, and reliability. Trust models embedded in provenance will be rich enough to support the computation of trust recommendations that can describe several aspects of trust including distrust, partial trust, ignorance, and inconsistencies.


Team

  • Paulo Pinheiro da Silva (Lead, CS)
  • Vladik Kreinovich (CS)
  • Ann Gates (CS)
  • Leonardo Salayandia (Research staff – Center Team)
  • Nicholas Del Rio (Ph.D. student)
  • Aida Gandara (Ph.D. student)
  • Alejandro Castaneda (M.Sc. student)

Summary of Year 1 Activities

Provenance. Activities centered on the development of a lightweight Proof Markup Language which will encode basic justifications and can be used by a restricted set of Inference Web tools for retrieval and browsing of provenance information.

Trust. The focus was the development of TrustMap, a strategy for propagating and aggregating trust about a scientific artifact. TrustMap can facilitate the acceptance of an artifact as a quality product by its users. CI-Learner, another approach developed to extract a social trust network from information available on the web, considers co-authorship as an explicit evidence of trust between agents of the network, and it limits its scope to trust relations that can be extracted from one scientific portal. An IRIS use case scenario was developed and used to extract trust network information for the Earth Science community. Activities demonstrated the potential of providing trust recommendations through trust maps and the use of data to construct a social trust network. A user study assessed completeness and usefulness of ranking as perceived by members of the network.

Workflow. Abstract workflows in the form of workflow-driven ontologies (WDOs) were used to document and design workflow artifacts. An abstract workflow can be manipulated by domain experts because abstraction hides implementation details irrelevant to the experts. As a result, domain experts can better understand and independently enhance the workflow.

The WDO approach has been formalized and WDO-specific tools have been developed to replace generic tools such as OWL editors and spreadsheets. With the introduction of an upper-level class hierarchy of workflow-related classes, WDOs facilitate the process of creating ontologies to be used on scientific domains. With the introduction of a well-defined set of relationships between classes, WDOs are used to guide scientists through the process of relating classes in a way that can later be polished into useful workflow specifications. With the introduction of indirect relationships, WDOs enable knowledge capture for underspecified processes that can be refined later.

Uncertainty. Activities focused on developing new and faster algorithms for processing uncertainty related to cyberinfrastructure where different data points come with different, e.g., probabilistic, interval, and fuzzy, information about their uncertainty.


Summary of Year 2 Activities

The team started implementing systematic semantic enhancements to geoscience and environmental science processes to improve the problem-solving capabilities of these processes. The enhancements are part of the subproject goal of working collaboratively with geoscientists and environmental scientists to accomplish the following objectives:

  1. To understand how scientists tackle scientific problems, and in particular, how they develop, execute and improve complex processes to solve those problems;
  2. To expose scientists to semantic and cyber-infrastructure technologies that have the potential of overcoming known limitations of the current scientific processes;
  3. To indentify key enhancements that can better support solving the problems identified in #1;
  4. To develop tools that can be used to create and maintain artifacts in support of enhancements identified in #2; and
  5. Using the proposed enhancements in #3 and the tools in #4, to collaboratively work with scientists to enhance their complex processes.
These objectives led to documenting selected scientific processes. A goal is to establish a new Cyber-ShARE-way of enhancing these scientific processes with semantic information, the CI-Miner approach, and to develop human resources in other scientific fields capable of replicating and disseminating the use of semantic technology. The following processes were documented with semantic enhancement:
  • Hole’s Code – Earth science/seismology
  • Reflectance Data Gathering – Environmental science
  • 2 ½ D crustal structure of the Earth – Earth science
  • National Center for Atmospheric Research (NCAR) CHIP Quick Look Process – Solar physics
More efficient algorithms were developed for processing uncertainty, especially uncertainty related to cyber-infrastructure, where different data points have different information about their uncertainty.

 

Related Links

Project Website »