Uploaded image for project: 'Metadata Aggregator'
  1. Metadata Aggregator
  2. MDA-135

add subtractive merge strategy

    XMLWordPrintable

    Details

    • Type: New Feature
    • Status: Open
    • Priority: Minor
    • Resolution: Unresolved
    • Affects Version/s: 0.8.0
    • Fix Version/s: None
    • Component/s: Pipeline
    • Labels:
      None

      Description

      We have a couple of merge strategy implementations today. The first just includes all entities from all collections, and it's probably doesn't have a lot of application when processing SAML metadata, where there is a constraint that each entityID only appears once. For that case, we have the deduplicating strategy, and that works very well for the case in federations like the UKf where imported metadata is added to an existing collection and published as a single aggregate.

      Some other federations publish imported metadata in a separate aggregate, but would still like to exclude those entities present in their main aggregate. This could be achieved using a "subtractive" merge strategy. I can think of two definitions for this.

      Option 1 would be like the deduplicating strategy except that the entities from the base collection are not included in the output. They would still act to remove "duplicates" from the collections being merged "in", however. This would allow multiple aggregates to be deduplicated with a single source of, essentially, a dynamic blacklist.

      Option 2 would be to use the collections being merged as blacklist sources. This seems a little more natural for the simple case, but means that people merging multiple metadata sources against a single blacklist source would need to perform a two-stage merge.

        Attachments

          Activity

            People

            Assignee:
            ian@iay.org.uk Ian Young
            Reporter:
            ian@iay.org.uk Ian Young
            Watchers:
            1 Start watching this issue

              Dates

              Created:
              Updated: