Natural Language Processing
Technology Life Cycle
Sales growth slows as the market becomes saturated. The technology is well-established and competition peaks, leading to price drops and marginal improvements.
Technology Readiness Level (TRL)
Technology is operative and demonstrates considerable market competition among manufacturing industries.
Technology Diffusion
Skeptical and adopts technology only after it has become mainstream and the benefits are well proven.
Connecting resources from computer science, artificial intelligence and computational linguistics, natural language processing can capture natural human languages while evaluating the meaning and significance and completing tasks involving syntax, semantics, discourse, and speech. Human speech is not always precise, and often ambiguous; the linguistic structure can depend on complex variables with varying amounts of slang, dialects, and social context.
Nevertheless, technologies involving virtual assistants, automatic speech recognition, machine translation, question answering, and automatic text summarization have been improving drastically and even approaching human performance in some regards. In educational and training settings, this technology enables many professionals to gain access to these environments, for example, illiterate individuals and those with disabilities.
Deep neural networks are pushing the current boundaries of natural language processing, by encoding the semantic relationship between words into so-called word vectors. Word vector encodings can be learned automatically from large corpora of unstructured texts and have desirable properties such as being close if they have similar meaning and allowing semantic relationships between words to be expressed mathematically such as king – man + woman = queen without ever having been explicitly taught these.
By combining natural language processing and machine learning with the techniques of stylometry, the study of linguistic style rooted in the 15th century, it is possible to learn an author’s writing style and determine with increased accuracy which other texts have been written by the same person. This has already helped resolve long-standing disputes about the origin of certain historical texts and we have supposedly identified the author of the original Bitcoin white paper.
The applications of natural language processing are numerous as human language pervades almost every aspect of both online and offline life. In education, the content and cognitive complexity of a written essay could be automatically analyzed, undergo plagiarism detection, provide feedback and even give a score. Sentiment analysis is already being used for automated marketing campaigns, predictive trading, and news event classification.
Future Perspectives
The landscape of human-computer interactions is changing rapidly with natural language processing allowing data-informed design through conversational interfaces. Estimates claim unstructured data accounts for more than 90 percent of the digital universe, much of it coming in the form of text. With the exponential growth of data everywhere, humanity will depend on intelligent machines to extract, digest and present this data in meaningful ways. Applied to conversational robots and communication apps, the ability to naturally process human speech brings about a shift in technological development. Humans have been interacting with machines according to their language for decades now, but engineers and scientists are reverting this situation by teaching machines to understand human language and reproduce the same structure for more fluid and realistic interactions with users.
Automatic real-time translation could soon mean a world without language barriers. Imagine how much more productive we could be, or how many people we could learn from or talk to that we previously couldn’t. Without language barriers, the world opens up, especially to those who don’t have the privileges of first-world countries. The success of state-of-the-art neural language models which can represent meaning in the form of numerical vectors may allow us to wonder how the human brain processes language and help us understand the notion of consciousness, which is closely tied to semantic reasoning. If we can represent concepts numerically then maybe language becomes an implementation detail and we could choose to "render" a certain idea or concept to a language and writing style of our choice.
Image generated by Envisioning using Midjourney