Watson Cloud is the managed service version of IBM’s AI-powered cloud software Watson. IBM currently offers a free developer trial of Watson Cloud, which you can use to build applications or automate tasks in just a few minutes.
Watson Cloud provides APIs for all of the services provided by Watson for use in your applications. This includes text-to-speech, language translation, speech recognition, and more.
Features
Watson Cloud provides a robust set of features that make interacting with computers more human. This includes, but is not limited to:
- Natural language understanding
- Conversational agents
- Computer vision
- Face detection
- Speech to text conversion
- Audio transcription
- Virtual assistant
- Chatbots
Getting Started
Getting started with Watson Cloud is extremely easy. Just log in to the console, select the `watson` sub-domain from the top-right dropdown menu, and follow the instructions.
From there, you can create your first bot for free. You don’t have to provide a name or image for your bot. Simply follow the steps.
Provide the bot with a few words of explanation and start building your first application. You can use the same **Python** API to build both your bot and application. Don’t worry, we’ve got you covered. IBM already created a fully featured starter kit for you to use. You can find it on the Downloads page.
API Overview
The APIs provided by Watson Cloud enable developers to easily build advanced applications using its platform. Each API provides developers with a specific skill set that can be used to create intelligent applications. For example, the Text-to-speech API enables developers to build voice-enabled applications like virtual assistants and automated call centers. The API documentation for Watson Cloud can be found on the Api Documentation page.
Text-to-Speech
One of the most useful and straightforward APIs provided by Watson Cloud is Text-to-speech. This API enables text input and display functions that are useful for creating virtual assistants and other applications that require text to be processed. For example, using this API you can have your chatbot ask questions like, “How can I help you?” or “What would you like to know about?” You can use this API in combination with the Conversation API to build virtual assistants that can converse with humans using natural language.
You need to use Text-to-speech in combination with the Speech API to perform speech recognition. The Text-to-speech and Speech APIs work in tandem to allow the computer to understand what you are saying. For example, if you are chatting with a bot and ask it to repeat something, the Text-to-speech API will transcribe the spoken words and the Speech API will match these words to what you said.
Conversation
The Conversation API is another useful API provided by Watson Cloud. This API provides a way for developers to have text-based conversations with users and have these conversations be stored and searchable. This means that if you build an application using the Conversation API, you can have the computer respond to user questions and remarks with contextual information.
For example, you can use this API to have the computer respond with information about an event or topic discussed in the conversation.
The Conversation API is useful for applications where you need to have a natural conversation with a human. You can use it in combination with the Text-to-speech API to have your chatbot converse with users. However, you should avoid using this API in applications where the computer’s output is critical, as the output is not always perfect and can sometimes be misinterpreted.
Image Search
The Image Search API is a way of having the computer perform image searches on a massive scale. This can be useful for building applications where you need to search for images, videos, and other content.
You can use the Image Search API to have the computer scan the web for images and display a result set. The only requirement is that the images must be stored on a publicly available web server. Image Search enables you to perform searches on hundreds of thousands of images at a time.
Language Translation
The Language Translation API is a way of having the computer translate text. This API is useful for having text read aloud in a different language. You can use this API in combination with the Text-to-speech and Conversation APIs to build an application that will translate text for you.
Some languages are more difficult to translate than others. The English language has been a source of great frustration for many people, especially those who try to use automated text-to-speech services that can’t keep up with the complexities of the English language. With the Language Translation API, you can have the computer learn your language and become an expert at translating English to your native language and back again.
Speech
The Speech API is a way of having the computer understand and produce sound. Like the Text-to-speech API, the Speech API is useful for building an application that will process audio files. However, unlike the Text-to-speech API, the Speech API provides direct access to ***recorded*** audio files. This means that you don’t have to transcribe audio files yourself.
Use the Speech API to have the computer perform general tasks like recording, editing, and identifying audio files. You don’t need to have any previous experience to build applications with this API.
Face Detection
The Face Detection API is used for identifying objects and people within images. You can use this API to detect faces in an image and perform additional actions depending on whether or not faces are detected. For example, if you want your application to be able to distinguish images of cats from images of dogs, you can use face detection to find and analyze facial features.
The Face Detection API is closely associated with the Computer Vision API, but you should not use them interchangeably. The Face Detection API provides an easy way for developers to implement facial recognition technology in their own applications. You should use the Computer Vision API to do facial recognition on your own. The Face Detection API is useful for simple tasks like identifying items in an image or determining the gender of a face.
Spell Check
The Spell Check API is used for checking the correctness of a word or phrase as you type it. This API is very similar to the Grammar Checker function in Microsoft Word. Just like the Spell Check API, Microsoft Word’s Grammar Checker can be used to detect errors in your writing as you are typing. You can use the Spell Check API to have your text automatically checked for grammar and spelling errors as you type.
As with the Grammar Checker in Microsoft Word, the Spell Check API provides helpful suggestions for your writing. This makes it easy for non-English speaking users to have their text checked for errors without needing an expert in the English language to check it for them.
Virtual Assistant
The Virtual Assistant API allows you to build intelligent applications that can interact with humans using natural language. This is similar to Robin Honey’s popular A.I. Virtual Assistant app, which you can read about on Apptopia.
The difference is that you don’t need to build a mobile app to use the Virtual Assistant API. You can use the API from any web browser.
For example, say you want to build a blog that interacts with readers using natural language. You can use the Virtual Assistant API to have your blog respond to comments with relevant information. Or, you can have your blog search for relevant products and recommend them based on what the user types in. Use the free version of the API to see how it works.
Machine Learning
The Machine Learning API provides developers with the tools they need to perform complex data analysis and train machines using large amounts of data. Just like the other APIs provided by Watson Cloud, this API is accessible from any web browser.
For developers who want to perform complex analysis on large datasets, the Machine Learning API provides the functionality they need to make that possible. This API provides developers with a robust set of tools that are useful for building intelligent applications that can learn and analyze data to determine trends and make predictions about the future. For example, you can use this API to have your blog analyze the most popular topics and determine what else should be written about.