Configure Sitecore 9.1 Content Tagging feature with Open Calais
What is Tagging?
Tagging refers to the metadata of the content which is assigned to a content by the content creator and the readers/users of the content to group the content in the same category.
Content tagging is the very significant feature introduced in the Sitecore 9.1 Release, I was eagerly waiting for the Sitecore 9.1 release, as I heard about this feature in the Sitecore Symposium 2018, Orlando.
Content tagging is the wider term to implement and to create the proper taxonomy of the content, we might need one full-fledged intelligent system, who can analyze the content and automatically tag the content, based on the relevancy of the content keywords.
I know, still in the 21st century, many people are doing this job manually, company’s hires dedicated content authors, who read the content manually line by line and based on the taxonomy bucket, they tag the content accordingly and it becomes worst when you are creating lots of articles/publication per day.
Now, with the help of Sitecore Content tagging feature, we can easily automate this manual task with the help of integration with any intelligent tagging (metadata) provider/system including Sitecore Cortex and Open Calais.
In this ERA, the whole requirement of the current digital marketer is to automate the Metadata tagging with the click of one button. As we are using the Sitecore to create the content page, we wanted something on the Sitecore ribbon button to tag the content with the relevant tags
Now, without taking further time, let me quickly introduce the best tagging provider in the world called Open Calais by Thomson Reuters
Thomson Reuters Open Calais™ offers the easiest and most accurate way to tag the people, places, companies, facts, and events in your content to increase its value, accessibility, and interoperability
We use Natural Language Processing (NLP) and machine learning algorithms trained by hundreds of Thomson Reuters’ Editorial teams for several years to offer the industry’s best combination of company extraction and relevance. For the user, the process is pretty simple. You feed unstructured text into the extraction engine (news articles, blog postings, etc.) to examine your text and locate:
· Entities: (Companies, people, places, products, etc.)
· Relationships: (John Doe works for Acme Corp.)
· Facts: (John Doe is a 42-year old male CFO)
· Events: (Jane Doe was appointed a board member of Acme Corp.)
· Topics: (Story is about M&As in the Pharma industry)
Now, we understood what is content tagging, why Auto-tagging is important and what Open Calais is?
Let’s start the practical, as we are going to configure the auto-tagging feature in the Sitecore.
Please note, this feature is only available in the Sitecore 9.1 Release, if you haven’t installed the Sitecore 9.1 yet, then please install it.
Step1: Create the account on Open Calais
http://www.opencalais.com/opencalais-api/ and get the Access code as below:
Step 2: Configure the API in Sitecore 9.1
When you navigate the default Sitecore 9.1 content tree and go to home tag ribbon, you will find the open called “Tag” which is newly introduced in the Sitecore 9.1, by default it’s disabled until you configured any content tagging provider.
Now go to the Sitecore configuration
1. Navigate to \App_Config\Sitecore\ContentTagging\Sitecore.ContentTagging.OpenCalais.config
2. Provide the value of the Open Calais token in the below setting which we generated in the Step1:
<setting name="Sitecore.ContentTagging.OpenCalais.CalaisAccessToken" value="" />
- Value = open Calais token
- Leave language = English and endpoint = https://api.thomsonreuters.com/permid/calais
All set for the configuration, now let's consume the APIs in the Sitecore
Step 3: Consume the Open Calais API In the Sitecore Content Tree
Go to the content tree and update any Sitecore content item with some data, for demo purpose, I just update the below paragraph in the RTE field
“ Sitecore’s digital experience platform combines content, commerce, and personalization to help customers easily create the most relevant, individualized experiences possible. We capture interactions at every touchpoint to provide a personalized experience across every channel, continuously over the lifetime of the customer. Businesses look to Sitecore to power the most relevant experiences possible. “
As you can see, the tag option is enabled now, after configured the API key in the configuration file.
Now > click on Tag Item >
The moment, I click on the Tag item button, Sitecore starts the Content tagging service to call the API and get the data:
Now, check your tags of the item, navigate to the tagging section, and here is the results:
All relevant tags have been added automatically, without any manual intervention such as Technology_Internet, Business_Finance, Information science, Content Management System etc.
PDF Search for Content Tagging
The Content tagging feature does not only search the RTE, single, multi-line fields, it also supports the documents such as PDF for content tagging, let’s take an example, you have some Sitecore items and some relevant documents which need to attach with the Items for example – Product information PDF, and you want to search the content from the PDF also and tag some meta tag based on PDF data, then it is possible with this integration as Open Calais supports the PDF documents.
Now, I am going to attach one SIF Configuration pdf In the sitecore internal link field as below:
The tags are blank now:
Now> adding the tag from the ribbon, and here is the result:
isn't this cool
Some other Observations:
1. This functionality required the internet connection, as its calling https://api.thomsonreuters.com/permid/calais API, so please make sure in the production, internet should work, and this API link should be whitelisted.
2. All the open Calais tags are stored in the Sitecore itself as a sitecore bucketable item in the below location:
Demo in Thomson Reuters Open Calais Website
Want to quickly check the Open Calais results, without importing into the Sitecore, no worries, here is the easy way, go to the
Try to paste some content or upload any document, and you can able to find the relevant tags immediately on the screen as below:
I hope you enjoyed this article, and it will help you to configure the Sitecore 9.1 content tagging feature.