Daniel Avancini — the chief data officer at Indicium, a data services company — shared with VKTR his views and expertise on data in AI chatbot development:
Developing AI Chatbots
- Which internal departments and teams should be involved in the data selection for AI chatbots, the data refinement and development, and the implementation?
- What types of internal and customer data should companies use to train AI chatbots?
- What data sources should companies use to train their AI chatbots?
- When should data for AI chatbots be updated and by what means?
- How can data teams help ensure AI chatbot accuracy and application-specific effectiveness?
- Why does sound data management matter in AI chatbot development and implementations?
- What's your opinion of AI chatbots in the market?
- What tasks are AI chatbots best suited for?
- Should companies build their AI chatbots, buy them or outsource their development?
Which internal departments and teams should be involved in the data selection for AI chatbots, the data refinement and development, and the implementation?
In developing AI chatbots, collaboration across various internal departments is crucial for success. For data selection, the data science team plays a pivotal role in identifying and sourcing relevant data sets that will train the chatbot effectively. Business analysts work alongside them to ensure the selected data aligns with the company’s objectives and meets user needs. Subject matter experts (SMEs) provide domain-specific insights, helping determine the relevance and accuracy of the data. The legal and compliance department must also be involved to oversee issues related to data privacy, consent and regulatory compliance, ensuring that all data used adheres to necessary laws and guidelines.During the data refinement and chatbot development phase, the role of the AI engineer becomes central. AI engineers collaborate with data engineers who handle data pre-processing, cleansing and transformation to prepare the data for training. Natural language processing (NLP) specialists work on developing and fine-tuning language models to understand and generate human-like text. AI engineers integrate these models into the chatbot, optimizing algorithms for better performance. Software development teams build the chatbot’s architecture, while UX/UI designers focus on creating an intuitive user interface and seamless user experience. Quality assurance (QA) teams are responsible for testing the chatbot to ensure it functions correctly and meets quality standards.
For the implementation stage, IT operations manage the deployment, monitoring and maintenance of the chatbot within the company’s infrastructure. AI engineers continue to be involved by monitoring the chatbot’s performance, addressing any technical issues and implementing updates or improvements. DevOps teams facilitate continuous integration and delivery pipelines to ensure that updates can be rolled out smoothly. The customer support team is essential for handling any escalations the chatbot cannot resolve and gathering user feedback to inform future improvements. Training and development teams work on educating both staff and users about how to interact effectively with the chatbot. Finally, the marketing department promotes the chatbot to customers, highlighting its benefits and encouraging adoption.
What types of internal and customer data should companies use to train AI chatbots?
Companies should leverage a diverse range of internal and customer data to train their AI chatbots effectively. Customer service transcripts and chat logs provide real-world examples of customer inquiries and issues, which help the chatbot understand common questions and appropriate responses. Emails between customers and support staff offer additional context and language styles used by customers. Frequently asked questions (FAQs) and knowledge base articles supply standardized answers to common queries, ensuring the chatbot can provide accurate information consistently.Incorporating user feedback and survey responses allows the chatbot to be trained on actual user preferences and pain points, enhancing its ability to meet customer needs. Company policies, procedures and product manuals are essential for the chatbot to provide information that is compliant and up-to-date regarding the company’s offerings. Product and service updates should also be included to ensure the chatbot is informed about the latest changes, features and promotions. By utilizing this comprehensive set of data, AI engineers can train chatbots to interact effectively with users, providing accurate and helpful responses that enhance customer satisfaction.
What data sources should companies use to train their AI chatbots?
Companies should utilize a combination of internal and external data sources to create a well-rounded AI chatbot. Internal databases, such as customer relationship management (CRM) systems and enterprise resource planning (ERP) data, provide valuable information about customer interactions and product details. Knowledge management systems offer access to internal documentation, policies and procedures the chatbot needs to understand to provide accurate information.
Public data sets and open-source resources can be used to enhance the chatbot’s language models, helping it understand and generate human-like responses. In some cases, integrating data from third-party data providers can enrich the chatbot’s knowledge base with specialized information. Social media interactions, when anonymized and used responsibly, can help the chatbot stay current with language trends and public sentiment. AI engineers are responsible for selecting appropriate data sources, ensuring data quality, and integrating them effectively into the chatbot’s training regimen.
When should data for AI chatbots be updated and by what means?
Updating data for AI chatbots should be a regular and systematic process to maintain their effectiveness and relevance. Data should be refreshed at regular intervals, such as monthly or quarterly, to incorporate routine updates in information. Immediate updates are necessary after significant changes, like new product launches, policy amendments or rebranding efforts to ensure the chatbot provides current information. Implementing continuous learning mechanisms allows the chatbot to adapt in real-time, learning from ongoing interactions and new data inputs.
User feedback integration is crucial. Incorporating insights from customer interactions helps AI engineers and data teams identify areas for improvement. Utilizing automated data pipelines and machine learning operations (MLOps) enables efficient data ingestion and model retraining processes. AI engineers play a key role in setting up these systems and ensuring that updates are seamlessly integrated into the chatbot’s functioning without significant downtime or manual intervention.
How can data teams help ensure AI chatbot accuracy and application-specific effectiveness?
Data teams are instrumental in ensuring that AI chatbots perform accurately and meet specific application needs. They start by conducting data validation, regularly checking data sets for accuracy, relevance and completeness. This process helps in building a solid foundation for the chatbot’s knowledge base. To mitigate bias, data teams analyze data sets to identify and address any inherent prejudices that could affect the chatbot’s interactions, ensuring fair and unbiased responses.
Performance monitoring is another critical area where data teams contribute. By analyzing chatbot interactions and user feedback, they can identify patterns that indicate areas for improvement. This information allows AI engineers to make targeted adjustments to the chatbot’s algorithms and responses. Collaborative development between data teams, AI engineers and other stakeholders ensures that the chatbot is aligned with business goals and user expectations.
Implementing feedback mechanisms enables the collection of user insights, which are invaluable for continuous improvement. Data teams work with AI engineers to incorporate this feedback into the chatbot’s learning process. Continuous training and updates are essential for keeping the chatbot relevant, and data teams facilitate this by providing updated data sets and retraining models as needed. Through these efforts, data teams help maintain the chatbot’s accuracy and enhance its effectiveness in serving users’ needs.
Why does sound data management matter in AI chatbot development and implementations?
Sound data management is foundational to the success of AI chatbot development and implementation. It ensures the data used to train the chatbot is of high quality — accurate, relevant and free from errors — which directly impacts the chatbot’s ability to provide reliable responses. Effective data management practices help in enhancing the user experience by reducing misunderstandings and increasing the chatbot’s usefulness to customers.
Moreover, proper data management is critical for compliance and security. It ensures sensitive information is protected and the company adheres to regulations, such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA). It also helps in preventing bias in the chatbot’s responses, which is essential for maintaining the company’s reputation and ensuring fair treatment of all users.
From an operational perspective, sound data management facilitates easier maintenance and scalability. Organized and well-documented data structures enable AI engineers and data teams to update and improve the chatbot more efficiently. This ongoing process is vital for the chatbot to remain effective in a dynamic business environment.
What's your opinion of AI chatbots in the market? 
AI chatbots have become much more versatile and sophisticated, but many companies still don’t fully leverage their capabilities. While they’re commonly used for handling FAQs and basic customer support, their real value lies in addressing more complex business challenges, such as automating routine processes, providing instant and accurate responses and even predicting customer behavior. When used effectively, chatbots can greatly reduce response times, enhance customer satisfaction and lower operational costs by taking over tasks that would otherwise require human intervention.To measure the ROI of these tools, companies should look beyond basic metrics, like cost savings and query volumes. Important KPIs include customer satisfaction scores, reduced response times, engagement levels and conversion rates. Additionally, tracking revenue generated directly through chatbot-driven lead generation, upselling and sales automation provides a clearer picture of their impact. When chatbots are used strategically, they also offer valuable insights into customer behavior, enabling businesses to make data-driven decisions and maintain a competitive edge.
What tasks are AI chatbots best suited for? 
Chatbots are great at handling repetitive, structured tasks, like answering FAQs, managing bookings and processing transactions. But they aren’t a one-size-fits-all solution. They struggle in areas requiring deep empathy, complex problem solving or creativity — situations where human intuition and flexibility are crucial. As chatbots become more advanced, their role will evolve to complement human efforts, focusing on tasks that require speed, consistency and scalability. Businesses that understand both the strengths and limitations of chatbots will be better positioned to use them wisely, maximizing their potential while keeping the human touch where it’s needed most.
Should companies build their AI chatbots, buy them or outsource their development?
The decision to build, buy or outsource chatbot development depends on the company’s specific needs, budget and technical resources. Building in-house offers the most control and customization, allowing the chatbot to be tailored to exact business requirements. However, this approach demands a significant investment in technology, skilled developers and ongoing maintenance, making it resource-intensive.
Off-the-shelf solutions, on the other hand, are quicker and more affordable. They come with pre-built features and integrations but may lack the flexibility to adapt to unique or evolving business needs.
Outsourcing to third-party experts can be a balanced approach, providing customized, scalable solutions without the overhead of internal development. This option allows companies to benefit from specialized expertise while managing costs effectively. Additionally, with a trusted vendor, third-party solutions can be just as secure as in-house developments, ensuring data privacy and security.
Ultimately, the best approach aligns with a company’s goals, technical capabilities and long-term strategy.