to-do-microsoft-front

Microsoft released To-Do

Microsoft released a test version of To-Do, a task list application for Android, iOS and Windows yesterday, Wednesday (19th April). It was developed by the same creators of Wunderlist, purchased by Microsoft in 2015, and will replace the older brother in the future. To-Do has an interface reminiscent of Wunderlist. Read more…

Office365 Roadmap

Office365 Roadmap

Office365 Roadmap A cool thing that not many people knows, is how the Office365 is dealing with requests from Community. There is actually a Roadmap that everyone that keep up with and lists updates that are currently planned for applicable subscribers. Updates are at various stages from being in development to Read more…

SharePoint Search – Content Enrichment WebService

Recently we came across the need to manipulate the values of Managed Properties. In our specific case it was required that for some contents, the File Type Managed Property should be empty.

In order to do so we used the Content Enrichment web service, which is a service that adds a custom step to content processing in order to enable users to modify managed properties values before they are indexed.

In this blog post we’ll walk you through the steps we took in order to configure the service and also talk about some of it’s limitations and how we can overcome them.

Creating the Web Service

The web service consists of a basic WCF that receives an array of managed properties and outputs an array of managed properties that are customized according to our needs. In our scenario we just needed to clean the FileType managed property of some contents, so our output will be an array containing just the FileType managed property with an empty value. Let’s begin:

  1. Create a WCF Service Application (ex: SLEnrichmentService)
  2. Add a reference to the Microsoft.Office.Server.Search.ContentProcessingEnrichment.dll (located in Installation Path\Microsoft Office Servers\15.0\Search\Applications\External\)
  3. Modify the service to implement IContentProcessingEnrichmentService the contract interface as following:
  4. Publish your service

Configure SharePoint

Publishing it is not enough, In order for the Search Service to call our service we first need to register it. To do that we need to run a simple Powershell in which we say where the Web Service is located, which Managed Properties it expects and what managed properties it will return. In this specific case we just need to clean the FileType property, so this will be both our input and output property.

Since we just need the service to be called for some types of content,  in our powershell we’ll have a trigger condition that validates that condition. In this scenario we have a custom Managed Property called SLExtChannel that holds the values of the types of contents, we will use it to check if the service should be called or not.

Here is our powershell:

After running the script all you need to do is perform a Crawl.

Conclusions

Although this was a simple example we can see how useful this web service can be, we can address issues like missing/inconsistent metadata, clean content,  merge data from other services, etc.

By using the trigger condition we can ensure that the service will be called only when that condition is verified minimizing the impact on the Crawl/Index.

Nonetheless there are still some drawbacks when it comes to using this service:

  • The content enrichment web service is a synchronous callout in the content processing pipeline, complex operations may impact the crawl durations/performance.
  • We can have only one Content enrichment service registered per Search Service Application, which means that all content sources will be enable to use the Content Enrichment Service.

If we want to have specific callouts for each Content Source we can still achieve that by registering a WCF Routing Service as our Content Enrichment Web Service and create a route table based on the content source performing the callout.

(more…)