Innodata Inc. (NASDAQ:INOD) Q1 2024 Earnings Call Transcript

In this article:

Innodata Inc. (NASDAQ:INOD) Q1 2024 Earnings Call Transcript May 7, 2024

Innodata Inc. isn't one of the 30 most popular stocks among hedge funds at the end of the third quarter (see the details here).

Operator: Greetings. Welcome to Innodata First Quarter 2024 Results Conference Call. At this time, all participants are in a listen-only mode. A question-and-answer session will follow the formal presentation. [Operator Instructions] Please note, this conference is being recorded. I will now turn the conference over to your host, Amy Agress, General Counsel at Innodata. You may begin.

Amy Agress: Thank you, Paul. Good afternoon, everyone. Thank you for joining us today. Our speakers today are Jack Abuhoff, CEO of Innodata; and Marissa Espineli, Interim CFO. Also on the call today is, Aneesh Pendharkar, Senior Vice President of Finance and Corporate Development. We’ll hear from Jack first, who will provide perspective about the business, and then Marissa will follow with a review of our results for the fourth quarter. We’ll then take your questions. Before we get started, I’d like to remind everyone that during this call, we will be making forward-looking statements, which are predictions, projections or other statements about future events. These statements are based on current expectations, assumptions and estimates, and are subject to risks and uncertainties.

Actual results could differ materially from those contemplated by these forward-looking statements. Factors that could cause these results to differ materially are set forth in today’s earnings press release in the Risk Factor section of our Form 10-K, Form 10-Q and other reports and filings with the Securities and Exchange Commission. We undertake no obligation to update forward-looking information. In addition, during this call, we may discuss certain non-GAAP financial measures. In our SEC filings, which are posted on our website, you will find additional disclosures regarding these non-GAAP financial measures, including reconciliations of these measures with comparable GAAP measures. Thank you. I will now turn the call over to Jack.

Jack Abuhoff: Good afternoon. We are very excited to be here with you today. We have lots of updates to share regarding the accelerated momentum we are experiencing across our business. First and foremost, we are pleased to announce record revenues for the quarter of $26.5 million, representing 41% year-over-year growth. Our growth in the quarter was driven by the value we are bringing to help the world’s largest tech companies build AI large language models, or LLMs. As a result of accelerated business momentum, we are raising our 2024 revenue guidance to an expected organic revenue growth of at least 40% year-over-year. This is double the growth rate we guided to last quarter. We are executing a multipronged strategy to deliver – designed to deliver extraordinary levels of growth over the next several years as we extend what we believe is our early leadership in generative AI solutions.

We’re focused on providing solutions at three levels of the Gen AI stack at the bottom layer, helping some of the world’s largest tech companies and independent software vendors, or ISVs, develop generative AI foundation models. In the middle layer, helping enterprises that prefer not to build models from scratch, but rather to leverage existing LLMs and other AI customized for them with their own data. And at the top layer, building generative AI-enabled platforms that are useful for niche industry requirements. Our primary focus this year is on that first layer of the stack, partnering with some of the world’s largest tech companies to develop generative AI foundation models. We are pleased with the success we are having thus far. We entered the year with agreements in place with five of the so-called Magnificent Seven companies, which are our group of well known, high performing Big Tech companies we believe will spend billions of dollars on generative AI data engineering over the next several years.

We announced today that we rewarded yet another program expansion from one of our Big Tech customers. We’re valuing this expansion at approximately $23.5 million of annualized run rate revenue once implemented. This is on top of the $20 million in new programs with this customer we announced less than two weeks ago on April 24. We expect that these programs will ramp up over the next two months. While our customer agreements typically contain early termination upon notice provisions, we believe this customer is committed to a significant multiyear LLM strategy from which we stand to benefit. In fact, we are in discussions with this customer regarding potential new programs and expansions beyond what we have announced so far. We also signed two additional Big Tech companies, a large, prominent generative AI company and a large prominent consumer facing ISV, investing substantially in generative AI foundation models.

As a result of these new wins, we now serve seven Big Tech customers. We believe we will continue to grow with these customer relationships in 2024, and that we may grow some of them, possibly quite substantially. For Big Tech customers, we provide a broad range of services to support their generative AI programs. This includes creating instruction datasets, which you can think of as the programming behind large language models. It also includes human preference data used in reinforcement learning and reward modeling to align models to human preferences and build guardrails against toxic biases and harmful responses. In a blog post last month that accompanied a major release, one of our large Big Tech customers stated that the quality of these instruction data sets has an outsized influence on the performance of their models, and that some of their biggest improvements in model quality come from carefully crafted instruction data sets and multiple rounds of quality assurance.

This statement crystallizes why we have become the partner of choice for such customers. We believe we are well positioned to anticipate Big Tech’s changing needs and to grow with them. It is evident that Big Tech’s aspirations extend beyond today’s predominantly text-to-text English language models. We foresee expansion in terms of multimodal models, domain and task specific models, models natively built in more than 30 different languages, and models capable of complex reasoning. All of these dimensions will require modeling with the kind of data that we create. We believe we are still in the early innings of this journey. I encourage you to read the latest quarterly earnings transcripts from the Mag Seven, generative AI is a prevailing theme, with promises of more gen AI models, more gen AI and products, and commitments to multiyear investment cycles and CapEx increases to support aggressive AI research and product development.

We believe the emerging enterprise market, which we call the middle layer, consisting of companies across verticals that seek to adopt generative AI technologies to be another important growth factor for Innodata, one that will ultimately dwarf the Big Tech market for us. In parallel with executing strategies to penetrate Big Tech, we’re taking steps to prepare for what we foresee as a likely explosion in the enterprise space. We believe, we are very well positioned due to our intimate knowledge of the gen AI roadmap of large tech companies, which has enabled us to gain exceptional domain expertise and the future product needs of the enterprise market. We believe that enterprise adoption is about to enter a Cambrian period of explosive growth as a result primarily of three technology developments now underway.

I’ll explain and illustrate each with examples. Today, enterprise users of generative AI are mostly using ChatGPT as a standalone application. We’ll call this level-zero use case. For example, if I’m an HR Director at Innodata test with revising Innodata’s employee handbook, I can prompt ChatGPT to write a first draft of the vacation policy. Companies are now shifting from this level-zero to what we think of as level-one. We think of level-one systems as those based on Retrieval-Augmented Generation, or RAG, which we believe are likely to become better performing for reasons I will explain shortly. RAG systems couple search technology and prompt engineering with such a level-one system and Innodata employee might prompt an Innodata HR chatbot with a request like please summarize for me Innodata’s vacation policy.

A search engine working behind the scenes would then retrieve Innodata’s vacation policy from a large document repository and insert it into the prompt of context, with an instruction to the LLM to answer the question, primarily based on the inserted policy. RAG-based systems are about to become more useful as the latest crop of soon to be released LLMs or for significantly expanded context windows. A context window refers to the amount of retrieved information that can be included with a prompt. By including more context, the chatbot can become more consistent, relevant, and useful. One of the Big Tech companies is about to release a new model with a context window that is eight times larger than that of OpenAI’s GPT-4 Turbo, enabling you to include, for example, 3000 pages of documents in a single prompt.

Two hands hovering over a laptop keyboard, ready to execute data transformations.
Two hands hovering over a laptop keyboard, ready to execute data transformations.

Today’s expert or advanced expert augmentation systems are, for the most part RAG-based systems that combine generative AI with humans in the loop to deliver improved productivity. In a few minutes, I’ll give you an example of such a system we started working on for a customer in the quarter. We believe a second technology development called agentic workflows will enable what we’ll call level-two systems. With an agentic system, rather than asking a question to a chatbot, you present a goal to a virtual agent. Your virtual agent then accesses multiple back end systems and LLMs talk to each other to accomplish your goal. Agentic workflows really open up the kinds of things you can ask computers to do with LLMs. With an agentic system, an Innodata employee might ask a virtual Innodata agent, please look up how many days off Innodata employees get, check how many days off I have left, and request a week off around my son’s graduation, so long as there are still available hotels in Boston.

Imagine that. Now, while the full realization of agentic workflows may be years away, we believe incremental progress is being achieved and will likely accelerate. The third development that we believe will accelerate enterprise adoption is that the cost of training and serving models is likely to go down dramatically, making it possible for enterprises to train and serve models at scale. Once this happens, we believe that companies are likely to want to fine tune their own models rather than relying on RAG-based architectures. We’ll call these level-three systems. Level-three systems will support more complex use cases and enable sensitive information to be processed in private clouds or on premises rather than being served up as context to third-party foundation models.

We intend Innodata to offer enterprise all of the services they require to navigate the journey from level-zero to level-three and beyond. This will include custom development, integration and fine tuning services, as well as managed services around data readiness and data governance and industry specific workflow platforms. We are not alone in our thinking that the enterprise market for generative AI is about to explode. In a report last year, Bloomberg estimated the market for generative AI focused IT services will grow to nearly $22 billion by 2027 and to nearly $86 billion by 2032, representing a 100% compounded annual growth rate for the 10 year period from 2022 to 2032. To position ourselves to drive enterprise growth, we are expanding our talent base, creating new accelerators, and winning new reference engagements.

This quarter, one of the largest legal information companies in the world engaged us to develop a new LLM-based workflow system for their complex operational processes spanning legal and regulatory law in multiple European countries and multiple languages. This is an example of what I referred to a few minutes ago as an advanced level-one system. Our implementation uses GPT and a combination of several techniques, including chain of density prompt engineering and a vector database with similarity matching. Our generative AI-enabled workflow system is expected to enable the customer to drive significant operational savings across high cost processes that previously relied entirely on humans with language and legal expertise. This quarter, we also delivered a generative AI powered tool that gathers on the fly insights from large-scale textual data, contextually analyzes the data for specific areas of interest, and performs language translations.

The technology aims to increase organizations efficiency by ensuring knowledge workers are equipped with the intelligence needed to make informed decisions. We built it into our agility public relations platform where we call it Intelligent Insights. We made Intelligent Insights generally available to agility customers in the quarter, and it has been well received. To build the solution, we utilize RAG-based prompt engineering. We recently demonstrated it as an accelerator to one of our large banking customers and it inspired a POC that we are now executing. Agility revenue in the quarter increased 16.5% year-over-year. We have over 1,400 direct customers and we’re generating cash. We’ve been a leader in the industry, rolling out cutting edge generative AI functionality that is bending the productivity curve for PR and communications professionals.

We started early last year with the release of PR CoPilot, the generative AI implementation that helps people write press releases and media pitches. In Q1, we announced the general availability of Intelligent Insights. We are planning another five significant generative AI feature releases over the course of the second half of this year and into the first quarter next year. As a result of what we believe is our generative AI leadership in the PR space. In the first quarter, we converted over 35% of demos to wins, up from less than 20% prior to implementation of our generative AI roadmap. Our customers told us that one of their biggest challenges was that they needed more hours in the day. With our generative AI innovations, we’re making tactical PR a less labor intensive process, giving our customers the time back they need for strategic thinking.

For both the Big Tech market as well as the enterprise market, we see additional opportunities in model safety, evaluation and responsible and ethical AI. We began working on trust and safety for one of our Big Tech customers in Q4 2023, providing model assessment and benchmarking services, which help ensure that models meet performance, risk and emerging regulatory requirements. We learned a lot from the work and development we’ve been doing on this engagement, so we decided to share our learnings, tools and innovation with the market more broadly. Just a couple of weeks ago, we announced our release of an open-source LLM Evaluation Toolkit, together with a repository of 14 semi-synthetic and human crafted evaluation data sets that enterprises can utilize for evaluating the safety of their large language models in the context of enterprise tasks.

Using the toolkit and the datasets, data scientists can automatically test the safety of underlying LLMs across multiple harm categories simultaneously, developers can understand how their AI systems respond to a variety of prompts and can identify remedial fine tuning required to align the systems to the desired outcomes. We expect to release a commercial version of the toolkit and more extensive, continually updated benchmarking datasets later this year. In Q1, we won two additional engagements for LLM safety and evaluation. One for a hyperscaler’s own foundation models and one for an enterprise customer of the hyperscaler through the white label program we have in place with the hyperscaler. In addition, in Q1 2024, we started pilots for a new customer and an existing customer around LLM Trust and Safety.

I’ll conclude with this we believe we have an incredible opportunity in front of us. We believe we have the talent, capabilities and scalability to support the world’s leading company’s efforts to build AI models and services and to help enterprises advanced AI and generative AI technologies. We believe we can drive best-in-class growth over the next several years and maintain our early leadership position in generative AI services. Moreover, we believe we can accomplish this without the need to raise equity, to incur debt, or to burn cash. This year, based on our current growth forecast, we intend to invest approximately $3.5 million in recruiting costs to scale our business and approximately another $3 million in new sales, marketing and product development talent.

The recruiting costs relate to the significant increase in revenues we expect this year and will not be incurred next year to support that revenue going forward. The investment in sales, marketing and product development are encouraged to continue our growth momentum and we anticipate that they will yield revenue and profitability benefits primarily next year and beyond. We anticipate approximately 70% of the recurring of the recruiting costs to be incurred in Q2 and most of the OpEx investment to be incurred in the second half of the year. We are making these investments while simultaneously driving year-over-year growth in adjusted EBITDA and building cash on our balance sheet. At the end of Q1, our cash balances were $19 million, up from $13.8 million at the end of Q4 2023, driven by positive cash flow from operations and tight working capital management.

I’ll now turn the call over to Marissa to go over the numbers and then we’ll open the line for some questions.

Marissa Espineli: Thank you, Jack, and good afternoon everyone. Let me briefly share with you our 2024 first quarter financial results. Revenue was $26.5 million, up 41% from $18.8 million in the same period last year. Net income was $1 million, or $0.03 per basic and diluted share, compared to a net loss of $2.1 million, or $0.08 per basic and diluted share in the same period last year. The adjusted EBITDA was $3.8 million compared to adjusted EBITDA of $0.8 million in the same period last year. Our cash and cash equivalent and short term investments were $19 million at March 31, 2024 and $13.8 million at December 31, 2023. We currently have unused line of credit of $10 million with $9.2 million as borrowing days. So thank you everyone. Paul, we’re ready for questions.

See also

11 Best Music Stocks to Invest In and

Top 20 Tech Companies in Silicon Valley.

To continue reading the Q&A session, please click here.

Advertisement