Webinar – Leveraging AI Tooling Across Your Software Development Lifecycle

November 7, 2025

Alan Laser

AI is transforming how software gets built. Teams that integrate AI into their SDLC the right way are seeing faster delivery cycles, lower costs, and higher ROI.

To help teams make that transition effectively, TechEmpower is hosting a webinar:
Leveraging AI Tooling Across Your Software Development Lifecycle.

The session will be moderated by Tony Karrer, CEO of TechEmpower, with featured guest Brent Laster,
author of The AI-Enabled SDLC (O’Reilly). They’ll share practical strategies for integrating AI tools
across every stage of software development—from planning and coding to testing, documentation, and deployment.

This webinar will help attendees connect the dots and move from ad-hoc AI experiments to real-world, AI-driven workflows that scale.

Event Details

Date:
Thursday, November 13, 2025
Time:
11:00 AM – 12:00 PM PST /
2:00 PM – 3:00 PM EST
Reserve your spot

What You’ll Learn

AI use cases across key SDLC phases: where to start and how to scale
Real-world examples that work: AI-assisted coding, reviews, testing, documentation, and more
Team enablement strategies: roles, prompting approaches, and workflows for adopting AI

All registrants will receive the slides and a full session recording.

2-week spike to ramp up on AI Coding Tools

October 23, 2025

Tony Karrer

We’ve seen many companies stumble when rolling out AI coding assistants. Success depends on building knowledge, skills, and practical habits. We’re helping across all aspects of rolling out AI tools, but we have found one practice that accelerates proficiency:

2-week (10 work-day) AI Coding Tool Ramp-up Spike

Here’s how it works:

2 days of focused training
- Day 1 (Fundamentals): Core patterns of AI-assisted development – How to write precise prompts, how to review AI results, and how to refine code without creating technical debt. Engineers leave with a systematic workflow rather than just ad-hoc examples.
- Day 2 (Advanced): Context management, multi-file refactors, breaking down features into AI-manageable chunks, debugging AI outputs, rules, MCP servers/services. Exercises surface common failure modes, ensuring teams build the reflexes to reset context, enforce consistency, and debug AI outputs.
8 days of supported, hands-on ticket work
- Developers pick up a variety of tickets and use the AI tool as part of getting the work done.
- Task journaling — Each developer keeps a lightweight daily log of what worked and what didn’t, building a shared playbook.
- Feedback loops: with AI champions — Daily check-ins with champions and facilitators and asynchronous support to help overcome early friction quickly and build skills quickly.

By the end of the two-week spike, engineers have built a foundation of habits, shared practices, and a clearer sense of where the tools genuinely improve code quality and developer experience. Leaders need to provide support for continued learning beyond this two-week period, but we’ve found this to be a critical first step.

Additional Reading:

Announcing TechEmpower’s AI Developer Bootcamp

October 6, 2025

Alan Laser

Announcing the AI Developer Bootcamp

I’m excited to share something we’ve been working on: the TechEmpower AI Developer Bootcamp. This is a hands-on program for developers who want to build real LLM-powered applications and graduate with a project they can show to employers.

The idea is simple: you learn by building. Over 6–12 weeks, participants ship projects to GitHub, get reviews from senior engineers, and collaborate with peers through Slack and office hours. By the end, you’ll have a working AI agent repo, a story to tell in interviews, and practical experience with the same tools we use in production every day.

Now, some context on why we’re launching this. Over the past year, we’ve noticed that both recent grads and experienced engineers are struggling to break into new roles. The job market is challenging right now, but one area of real growth is software that uses LLMs and retrieval-augmented generation (RAG) as part of production-grade systems. That’s the work we’re doing every day at TechEmpower, and it’s exactly the skill set this Bootcamp is designed to teach.

We’ve already run smaller cohorts, and the results have been encouraging. For some participants, it’s been a bridge from graduation to their first job. For others, it’s been a way to retool mid-career and stay current. In a few cases, it’s even become a pipeline into our own engineering team.

Our next cohort starts October 20. Tuition is $4,000, with discounts and scholarships available. If you know a developer who’s looking to level up with AI, please pass this along.

Learn more and apply here

Real-time Monitoring of LLM-Based Applications

October 2, 2025

Tony Karrer

We’re starting to see a pattern with LLM apps in production: things are humming along… until suddenly they’re not. You start hearing:

“Why did our OpenAI bill spike this week?”
“Why is this flow taking 4x longer than last week?”
“Why didn’t anyone notice this earlier?”

It’s not always obvious what to track when you’re dealing with probabilistic systems like LLMs. But if you don’t set up real-time monitoring and alerting early, especially for cost and latency, you might miss a small issue that quietly escalates into a big cost overrun.

The good news: you don’t need a fancy toolset to get started. You can use OpenTelemetry for basic metrics, or keep it simple with custom request logging. The key is being intentional and catching the high-leverage signals.

Here are some top reads that will help you get your arms around it.

AI Coding Assistants Update

September 16, 2025

Tony Karrer

The conversation around AI coding assistants keeps speeding up, and we are hearing the following questions from technology leaders:

Which flavor do we bet on—fully-agentic tools (Claude Code, Devin) or IDE plug-ins (Cursor, JetBrains AI Assistant, Copilot)?
How do we evaluate these tools?
How do we effectively roll out these tools?

At the top level, I think about:

Agentic engines are happy running end-to-end loops: edit files, run tests, open pull requests. They’re great for plumbing work, bulk migrations, and onboarding new engineers to a massive repo.
IDE assistants excel at tight feedback loops: completions, inline explanations, commit-message suggestions. They feel safer because they rarely touch the filesystem.

Here’s a pretty good roundup:

The Best AI Coding Tools, Workflows & LLMs for June 2025.

Most teams I work with end up running a hybrid—agents for the heavy lifting, IDE helpers for day-to-day quick work items.

Whichever path you take, the practices you use matter the most.

Some examples to get you started:

Publish a living coding-guidelines file before you turn agents loose—JetBrains’ Junie team shows a good pattern.
Coding Guidelines for Your AI Agents
Keep the agent’s toolchain fast and observable; Armin Ronacher’s post explains why slow tests and verbose logs burn tokens and patience alike.
Agentic Coding Recommendations
Reset context often—Philipp Spiess’s rule of thumb is “/clear when you change topics.”
How I Use Claude Code
Pair early adopters with skeptics and share metrics (time-to-PR, diff size). Thomas Ptacek’s rant is the best antidote to “LLMs are a fad.”
My AI Developer Skeptic Friends Are All Nuts

Reading list

Why Developers Should Care about AI-Enabled Software Engineering – A breezy primer you can forward to any hold-outs who think AI is “just for AI people.”
My AI Developer Skeptic Friends Are All Nuts – A punchy takedown of common objections and a reminder that agents ≠ copy-paste ChatGPT.
How I Use Claude Code – Real-world workflow tips: when to “/clear,” how to fork threads, and a cheeky alias for yolo mode.
Agentic Coding Recommendations – Armin Ronacher on keeping loops token-efficient and why Go beats Python for agents.
How Anthropic Teams Use Claude Code (PDF) – Department-by-department case studies and a template for your own “Claude.md” grounding file.
Coding Guidelines for Your AI Agents – How to bake language-specific dos & don’ts into a .junie/guidelines.md file so agents spit out idiomatic code.

Using Generative AI to Drive Corporate Impact

March 4, 2024

Nate Brady

Generative AI is revolutionizing how corporations operate by enhancing efficiency and innovation across various functions. Focusing on generative AI applications in a select few corporate functions can contribute to a significant portion of the technology’s overall impact.

Key Functions with High Impact

Generative AI is revolutionizing sales by enabling dynamic pricing and personalized customer interactions, boosting conversion rates and customer satisfaction. AI chatbots are increasingly capable of handling tasks traditionally performed by inside sales reps, such as initial customer contact, basic inquiries, and lead qualification. This shift allows business to reallocate human resources to more complex and strategic roles, or eliminate those positions entirely. Post-sale, AI analyzes customer data to improve service and loyalty, making it a cornerstone of modern sales methodologies. This AI-centric approach transforms sales into a data-driven field, emphasizing efficiency and personalized customer experiences.

Similarly, in customer support, AI-driven chatbots and automated response systems are taking over routine support, effectively handling common issues such as account inquiries or basic troubleshooting. TechEmpower has been instrumental in developing chatbots like these, utilizing generative AI to sift through internal documents and user manuals, enabling them to provide precise answers to customer service questions. This level of automation not only improves response times and consistency in customer service but also allows human customer support agents to focus on more complicated and nuanced customer interactions.

At TechEmpower, we are using LLMs, RAG, fine tuning and other Generative AI techniques to revolutionize a key part of day-to-day operations in healthcare. The standards in healthcare dictate that we achieve reliable results. Working closely with world-class medical experts, we have created an innovative solution that achieves accuracy and can be tailored to particular medical practices. The result significantly lightens the workload for healthcare professionals, allowing them to focus on decision making and patient care.

AI empowers businesses to craft more impactful marketing campaigns by utilizing data analytics for content personalization and market trend forecasting, thereby significantly enhancing campaign relevance and effectiveness. Instead of just counting clicks, AI can analyze a range of factors like user engagement duration, the relevance of ad placement in relation to the content being viewed, and historical purchasing behavior of the viewers. The shift towards AI-driven ad technologies enables brands to set and achieve highly specific engagement KPIs, moving away from generic strategies to more personalized, data-driven approaches that resonate with their target audience. At TechEmpower, we’ve used LLMs as part of marketing strategies where you can find and classify companies, personalize outreach campaigns and have personalized drip campaigns.

In the sphere of software engineering, AI is pivotal for corporate IT by automating coding, optimizing algorithms, and enhancing security to boost efficiency and minimize downtime. It plays a crucial role in product development too, where generative AI speeds up design processes, streamlines testing, and tailors user experiences effectively. This technological integration into software engineering not only enhances the productivity of development teams but also ensures that IT infrastructures are robust and reliable. By automating routine and complex tasks alike, AI allows engineers to focus on innovation and strategic tasks. Overall, generative AI is a transformative asset in the software engineering lifecycle, from conception to deployment. At TechEmpower, we’ve used generative AI across a wide range of capabilities for ourselves and our clients. This includes: Github Copilot, PR summarization, user story creation including test and edge cases, creating unit and behavior tests, query optimization, debugging, and more.

In the domain of Product Research and Development (R&D), generative AI acts as a catalyst for innovation, significantly accelerating the ideation and creation phases of product development. By processing and analyzing large datasets, AI can identify emerging trends, enabling companies to align their product strategies with future market demands. It also facilitates rapid prototyping, allowing for quicker iterations and thus shorter development cycles. In testing, AI can simulate a multitude of scenarios, predicting performance outcomes and potential failures before they occur, which reduces the risk and cost associated with physical prototyping. Overall, generative AI in product R&D not only streamlines the development process but also empowers companies to lead with cutting-edge, data-driven products.

Other Notable Functions

Generative AI is poised to revolutionize supply chain management by enhancing demand forecasting, enabling businesses to anticipate market changes and adjust inventory accordingly. It can also optimize logistics through route and delivery scheduling, leading to reduced operational costs and improved delivery times. In manufacturing, AI facilitates the transition to smart factories by implementing predictive maintenance, which minimizes downtime, and by optimizing production lines for increased efficiency and reduced waste. These advancements allow for a more resilient and responsive supply chain, as well as a manufacturing sector that can swiftly adapt to new challenges and opportunities, thereby driving substantial corporate impact.

In corporate finance, generative AI is a transformative force, enhancing decision-making and operational efficiency. AI’s prowess in detecting and preventing fraud provides an added layer of security, safeguarding assets and transactions. Moreover, it automates routine tasks such as transaction processing and report generation, freeing finance professionals to focus on higher-level strategy and analysis. By integrating AI, finance departments can achieve greater accuracy, efficiency, and risk management, significantly impacting the overall financial health and strategy of a corporation.

AI can significantly aid Human Resources (HR) departments in reducing costs through various means. It can be used to quickly scan and shortlist resumes, reducing the time and resources spent on the initial stages of the recruitment process. This not only speeds up hiring but also lowers the costs associated with lengthy recruitment cycles. AI-driven platforms can also streamline the onboarding process, providing new hires with personalized learning paths, thereby reducing the need for extensive HR personnel involvement and ensuring quicker employee ramp-up.

Incorporating AI into Corporate Legal departments can significantly reduce costs and enhance efficiency. AI-driven document review and analysis expedite the handling of large volumes of legal documents, contracts, and case files, saving considerable time and labor costs. Contract management is streamlined as AI systems monitor contract lifecycles, ensuring compliance and mitigating risks of costly oversights. Predictive analytics offered by AI can inform legal strategies, aiding in the decision-making process to avoid unwinnable cases and focus resources effectively. Additionally, AI facilitates automated legal research, stays abreast of the latest laws and regulations, and aids in compliance monitoring, preventing expensive legal violations.

While legal departments must be cautious in their use of AI, ensuring that it complements rather than replaces the nuanced judgment of experienced legal professionals, the benefits are substantial. AI-powered tools can handle routine inquiries and draft standard documents, freeing up legal staff for complex tasks. In litigation, AI greatly improves the efficiency of the e-discovery process. The overarching impact of AI in corporate legal settings is a more streamlined, cost-effective department, where resources are allocated strategically and the risk of legal missteps is minimized.

Want to learn how TechEmpower can help you drive impact with AI?

Conclusion

Generative AI is revolutionizing the way TechEmpower enables corporate innovation and efficiency across a multitude of sectors. By automating routine tasks, enhancing data analysis, and fostering personalized strategies, this technology is a strategic asset driving our clients towards a future marked by greater efficiency, cost-effectiveness, and innovation. We utilize generative AI to provide cutting-edge solutions across various domains, establishing TechEmpower as a leader in leveraging AI to deliver tangible benefits and drive progress for our clients.

Selecting a Software Development Company in 2024

December 11, 2023

Brad Hanson

In 2023, there were approximately 26.3 million software developers worldwide. This vast pool of talent showcases a wide range of experience and portfolios, quality of work, and inquisitiveness. Given this diversity, it’s important to be selective in the development services company with whom you choose to partner. In the 25 years that TechEmpower has been in business, we’ve seen thousands of companies come and go. Here is what we’ve learned:

Understanding your needs

Identifying the skills you truly need is paramount as different firms boast distinct skill sets. Here are some items to think about:

Have you defined the functionality?
Is user interface and graphic design a necessity? Do you have the basics already defined and merely need them fleshed out? Or is your project a clean slate?
Are there complexities revolving around algorithms or databases?
Do you anticipate scale issues presently or in the future?
Are specific technologies or platforms involved in your project?

You’ll discover firms that are prolific in design/interface and light on development, and vice versa. Some offer specialized skill sets like expertise in a particular programming language or framework, or specific domain knowledge. Depending on your needs, a combination of these skills may be desirable. In fact, you might have to secure them from diverse people/firms.

This article will primarily focus on locating and evaluating development companies, rather than design firms. If you require user interface or graphic design, the selection process will differ slightly. Some of the information below will apply. Ensure that you investigate the designers’ past work, samples of their work product, and their process. Know who will be undertaking the actual work, and who will be acting in a supervisory or account role.

Here’s what to consider

Experience and Portfolio: What type of projects has the company completed? Who was involved in those projects, and are they still part of the firm? Has the company handled projects similar to yours? Do they have experience with the technologies involved in your project? Make certain you explore these projects. Were they finished on time and on budget? Did the clients consider them a success? Are they publicly available?

Beware of being swayed by big-name firms or impressive name-dropping. Although noteworthy, working with large corporations differs remarkably from working with startups. Understand exactly what the company contributed to each project. Be wary of firms that claim portfolio items which were executed at a different company/role—unfortunately, this practice is not uncommon, especially in newer firms.

Quality of Work: The end product should not only look good but function as expected. Don’t be charmed by an impressive aesthetic at the expense of functional results. While the appearance matters, remember you are hiring the development firm primarily for its development skills, not its graphic design skills.

Inquisitiveness: Prior to starting the project, you should receive an estimate of the work effort. To provide an accurate estimate, the firm should ask a multitude of questions. Our blog post 53 Questions Developers Should Ask Innovators has a list of questions any good development team would ask. Companies that quote without inquiry are either oblivious to the questions required or uninterested in understanding your actual needs. Avoid them.

Assess the Company’s Website: The company’s own site provides a clue to its dedication to aesthetics and content. However, an overly attractive site could indicate a leaning towards design over development.

Employee and Contractor Details: How many full-time W2 employees and contractors do they employ, and where are they located? If so, what’s the vibe like? What are the employees and contractors’ skills?

Project Management: Get a clear understanding of the company’s process. How do they verify the ongoing progress of development? How do they handle testing? What are the review periods and your responsibility in the process? Ensure you know what each side expects from the other.

Budget and Deadlines: Determine if budgeting and deadlines are flexible. How does the percentage of their projects launched on-time and on-budget compare to upfront estimates?

Communication: Evaluate their communication style. Is there a project manager? An account manager? Will you have direct access to a lead developer? While beneficial, some project managers hinder effective communication.

Support and Maintenance: After the launch of your application, what support does the company provide? Do they assist with the transition to in-house or other developers? How do they handle hosting and support?

Client Retention: Do they have repeat or long-term clients?

References: The company should willingly provide references. Consider also reaching out independently to people at companies mentioned in their portfolio, accessible via LinkedIn.

Potential red flags

The following issues can suggest potential risks:

Lack of inquisitiveness
Not discussing mobile strategies
Recommending outdated technologies
The firm’s age (less than two years old)
The company’s size (fewer than 10 people)
Price significantly lower than competitors
Lack of maintenance planning post-launch
Disinterest in learning about you or your project
A high-pressure sales environment

In summary, ensure the company you choose aligns with your specific needs and shares your enthusiasm for the project. It’s a strategic choice that extends beyond a one-time development process and into anticipating future needs. By following these guidelines, you’ll be better equipped to select a web development company that accurately reflects your project aspirations.

Do you have an idea for a software project? Or do you need help evaluating software firms? Either way, we can help!

Framework Benchmarks Round 22

November 15, 2023

Nate Brady

We’re pleased to announce Round 22 of the TechEmpower Framework Benchmarks!

The TechEmpower Framework Benchmarks project celebrates its 10th anniversary, boasting significant engagement with over 7,000 stars on GitHub and more than 7,100 Pull Requests. Renowned as one of the leading projects of its kind, it benchmarks the peak performance of server-side web application frameworks and platforms, primarily using tests contributed by the community. Numerous individuals and organizations leverage the insights from The TechEmpower Framework Benchmarks to enhance their framework’s performance.

Microsoft has been steadfast in their dedication to improving the performance of their .NET framework, and has been active in the Framework Benchmark community to further this goal. With the announcement of the release of .NET 8, it is clear that performance is paramount.

Here are some updates from our contributors

@franz1981 on GitHub, @forked_franz on Twitter:

Right after Round 21 I’ve worked on the 3 projects delivering:

an improvement on HTTP parsing for Netty (affecting every Netty-based frameworks, actually, including Vertx and Quarkus), to make it more branch-pred friendly: https://github.com/netty/netty/pull/12321
found (and fixed) a 20 years old bug affecting a lot of Java programs (and Netty HTTP encoding and Quarkus’s ORM ie Hibernate), see https://github.com/netty/netty/pull/12709 and https://www.youtube.com/watch?v=PxcO3WHqmng&ab_channel=DevoxxUK (and my mate Sanne G. worked to fix the hibernate part -> https://github.com/Sanne): this has delivered a gigantic improvement, especially on Quarkus, better explained at https://redhatperf.github.io/post/type-check-scalability-issue/)
replaced epoll read/write with recv/send -> https://github.com/netty/netty/pull/12679 (which has delivered a 5-10% improvement on all Netty based servers)
I’ve been mentor of GSoC 2020 bringing io_uring into Netty (https://netty.io/wiki/google-summer-of-code-ideas-2020.html#add-io_uring-based-transport) and ported it (with @Julien Viet ) in Vertx at https://github.com/vert-x3/vertx-io_uring-incubator

All these changes has improved the performance of the mentioned frameworks from 40% to 200% depending on the test

Oliver Trosien says:

I would like to use that opportunity to highlight Scala’s “new kid on the block”, Pekko, which is a fork of Akka, and currently undergoing incubation as Apache project. One of the reasons for contributing it to the Framework Benchmarks, was to verify no obvious performance regressions were introduced in the process of forking, and the results look good! Pekko is very much en-par with its legacy counterpart.

@fundon

List a few of Rust’s performance optimizations.

In a real production environment, several approaches can be tried to optimize the application:

Specify memory allocators
Declaring static variables
- once_cell
- lazy_static
Putting a small portion of data on the stack
Using a capacity to new vector or hash, a least capacity elements without reallocating
SIMD

@fakeshadow says in response:

In general you should not take anything from tfb benchmark and simply consider it useful in real world. Context and use case determines how you optimize your code.

btw: xitca-web (bench code not including dependencies) does not do 2,3,5 and still remains competitive in micro bench can be used as a reference.

@synopse

I have written a blog post about TFB and object pascal – yes, we added our object pascal framework in round 22! About how we maximize our results for TFB, we tried several ways of accessing the DB (ORM, blocking, async), reduced the syscalls as much as possible, minimized multi-thread locks especially during memory access (a /plaintext request requires no memory allocation), and made a lot of profiling and micro-optimizations. The benefit of the object pascal language is obvious: it is at the same time a high-level language (with interfaces, classes and safe ARC/COW strings), safe and readable, but also a system-level language (with raw pointers and direct memory buffers access). So the user can write its business code with high level of abstraction and safety, but the framework core could be tuned to the assembly level, to leverage the hardware it runs on. Finally, OpenSource helped a lot having realistic feedback from production, even if the project and the associated FreePascal compiler are maintained by a small number of developers, and object pascal as a language is often underestimated.

Notes

A heartfelt thank you to all our contributors and fans! We recognize the complexities involved in executing a benchmarking project accurately. While it’s challenging to meet everyone’s expectations, we are committed to continual improvement and innovation, made possible with your invaluable support and collaboration.

Round 22 is composed of:

Run ID: 66d86090-b6d0-46b3-9752-5aa4913b2e33 on our Citrine environment.

Want to learn how TechEmpower can help make your web application faster?

Generative AI – The End of Empty Textboxes

November 13, 2023

Alan Laser

We recently completed a web-based application that uses a unique algorithm to match professionals with new career opportunities. As part of the onboarding process, the app asks both job seekers and employers what they’re looking for – in a text box – while providing a few suggestions in a pop-up. If you’ve ever used a similar application, (or if you’ve ever used the Internet at all) you’ve probably seen this approach before.

And if you’re like most people, you’ve also probably struggled to fill those boxes in. Everyone struggles with empty text boxes. Populating them can be hard work, especially when the content needs to be just right. This isn’t just our opinion – our startup metrics prove it! Even with the pop-up suggestions, we saw significant drop-off during user onboarding. Drop-off on the first page of an application is bad news. It means wasted advertising spend and lost goodwill.

The point is, empty textboxes aren’t just intimidating, they can significantly impact user engagement and conversion rates. On a different project, we’d just used a Large Language Model (LLM) – in this case OpenAI’s GPT – to provide users with pre-filled text boxes, with content based on choices they’d previously made. Instead of an empty box, the user gets one filled with content to use as a jumping-off point. If they like the content as-is, they can keep it; if not, they can change it. Either way, it’s a huge improvement over starting from scratch.

Applying that solution to our job matching site made perfect sense. In fact, it makes sense for almost any application that has text boxes! Leveraging LLMs to help users fill in text, whether it’s by providing starter content, samples, or highly personalized tips, makes their lives easier. That’s why we’re planning on building most of our sites this way in the future.

Bottom line: with LLMs, empty text boxes are going away.

Profile Blurbs and Writing Prompts

Let’s look at our job matching site in more detail. During signup, the professional is prompted to enter their profile into a form, with an upload box for a resume, fields for awards, skills and certifications, and then a textbox – 500 characters max – for their professional summary.

An empty textbox, demanding to be filled with a concise, compelling summary to impress potential employers is daunting. Fill it with the right words, and your dream job could be right around the corner. Get it wrong, and you’re just wasting fifty bucks a month.

For example, let’s consider Mark. He lives in Houston, he’s a dedicated math teacher, and a proud recipient of the Teach for America excellence award. He’s a right-brain guy, and not much of a writer, which makes an empty textbox a potential stumbling block. So instead, we fill it with a completely custom blurb, written just for him:

Hello! I’m Mark, an enthusiastic math teacher from Houston and a proud Teach for America honoree. I bring my passion for numbers to the classroom every day, drawing from my experience to inspire my students. My time with TFA instilled in me a deep commitment to education, inspiring me to dedicate my life to guiding young minds through the world of mathematics.

That blurb, and the following examples, were all generated from GPT in only a few seconds, at a cost of less than one penny. The information provided was all pulled from data he’s already entered – just Mark, Houston, Math Teacher, Teach for America.

And if this description doesn’t resonate with Mark, he can ask for a new one, while providing feedback. Maybe those references to TFA sound like bragging, or he thinks “passion for numbers” sounds silly. We could prompt Mark to enter descriptive keywords like “dedicated” and “engaging.” This gives Mark more control over the process, without requiring him to write much, and gives the LLM more to work with. Giving this feedback to GPT gives us a revised prompt – once again, in just a few seconds:

Hello! I’m Mark, a dedicated math teacher hailing from Houston. Every day, I channel my enthusiasm into creating exciting and engaging math lessons, leveraging my wealth of experience to motivate my students. My tenure at Teach For America cemented my commitment to education, motivating me to devote myself to navigating young minds through the captivating landscape of mathematics.

Thanks to his custom blurb, instead of closing his browser and feeling inadequate for the rest of the day, Mark completes his bio and becomes a happy customer. He’s happy because the hardest thing he had to do all day was done for him – in a way that allowed him to maintain control of the output. The LLM didn’t just write a blurb for Mark, it wrote the blurb that Mark would have written, if only he’d had the words. Note how this version incorporates Mark’s feedback and descriptive keywords. It’s this collaboration between the user and the LLM that drives good results.

In the pre-LLM era, an empty textbox was a tough challenge. Now that AI is here to help, every textbox is an opportunity to help a customer through the challenge. Which means better conversions and happier customers. Happy customers mean happy CEOs and founders – and that’s what makes us happy.

One of the main powers of LLMs lies in their ability to generate text that not only makes sense but is also engaging and personalized. They learn from millions of pages of text, understanding patterns, context, and even nuances that make the generated blurbs feel almost human.

If having an LLM write the whole blurb seems like too much, we could also fill our empty textbox with a writing prompt:

I know writing about yourself can be tough. Start by introducing yourself, your profession, and your location. Share your daily drive, such as creating engaging math lessons, and how your experience aids this. Be sure to mention your Teach for America experience, and end by stating your mission, like guiding young minds through the world of mathematics.

Along with personalized tips that inspire creativity:

Inject Professionalism with a Touch of Fun: Mark, as a dedicated math teacher, you likely have engaging stories from the classroom. Share an anecdote that showcases your innovative teaching methods. This adds a touch of personality and makes your bio more memorable.
Be Genuine: Emphasize your genuine passion for teaching and your commitment to education. As a proud Houstonian and a teacher, highlight your connection to your community and your ongoing efforts to improve the world around you.
State Your Career Goals: You’re passionate about guiding young minds in mathematics. Clearly mention that you’re seeking a role where you can continue influencing students’ lives and making a difference.
Skills and Interests: Consider adding personal skills and interests. Do you participate in professional development workshops, or perhaps enjoy problem-solving puzzles in your downtime? Including these details can provide a more holistic view of your professional persona.

Since LLMs can understand text as well as generate it, the tips could be interactive as well as personalized. As Mark types, the text could be sent to GPT; if it determines that he’s added a teaching anecdote, a check mark would appear next to that tip.

Fill Every Textbox

Bio blurbs are a difficult sort of textbox to fill in. But there are others almost as hard. Think about text boxes that have stymied you in the past. Look around your home or your office. (If you’re remote, use your imagination.) Do you see anyone frustrated by these empty textboxes? Off the top of my head, here are some examples, but the list is huge:

E-commerce product descriptions and FAQs
Real estate listings
Social media posts
Marketing emails
Job postings
Job application cover letters
Personal statements for college applications
Business proposals
Customer support emails
Press releases
Dating profiles

All of these text boxes are difficult to fill in, though not necessarily for the same reason. Product descriptions and listings require tedious editing to make them engaging – especially after the fortieth one. Customer support emails require both accurate information and a professional, helpful tone. An LLM can help fill in all these boxes, either directly, or by providing prompts, tips, and editing help.

Are you a founder or CEO or head of product? Go through your product right now and look for empty text boxes. You might be surprised how many there are. Each one represents a great opportunity to make your users more productive and happier. A solid LLM integration can transform the way they interact with your platform. It can empower them to express their ideas more effectively and confidently, no matter what they’re writing.

TechEmpower can help

In the era of LLMs and Generative AI, empty textboxes are a product mistake. Instead, you can enhance the user experience and produce a better result by enabling the user with a combination of the right questions and a starting point. The result is less drop off, i.e., better conversion rates.

But there’s a big difference between an LLM implementation and the right implementation for you. To get to what’s right for you, you need a tech partner with a deep understanding of your business needs, software development experience, data engineering skills and AI expertise. With over 25 years of experience in the software industry – and many successful AI integrations under our belt – TechEmpower can help you replace those empty textboxes with happy customers.

Want to learn how TechEmpower can help you fill textboxes with GPT?

How to Use Generative AI and LLMs to Improve Search

October 9, 2023

Nate Brady

Artificial Intelligence (AI), and particularly Large Language Models (LLMs), have significantly transformed the search engine as we’ve known it. This presents businesses with an opportunity to enhance their search functionalities for both internal and external users. With Generative AI and LLMs, new avenues for improving operational efficiency and user satisfaction are emerging every day. Let’s take a look at some of the many ways businesses can benefit from these new models to streamline operations and deliver faster and more accurate results. We’ll begin by looking at some real-world examples and then we’ll dive into more details of how these improved search capabilities can enhance your business.

Real-World Examples

Here are a few examples of search improvements where we’ve leveraged LLMs for improved search capabilities:

Streamlining Internal Documentation Access

Our team addressed the all-too-common challenge of scattered internal documentation by creating a tool that allows for fuzzy search across documents, with the ability to pose natural language questions, receiving intuitive natural language answers that might not be directly present in the documentation. Utilizing AI, the tool interprets the queries, scans through a broad range of documents spread across different repositories, and delivers relevant answers. This enables internal users to swiftly find answers to queries such as “What is our internal IP address?” or “How can I modify my direct deposit account?” This has significantly reduced the time team members spend searching for information, enabling them to stay focused on their core tasks.

Enhancing Front-End Product Search

We also developed a front-end search tool that revolutionized the way users search for products. While traditional search systems are bound by the constraints of keywords, fields, and specific taxonomies, this AI-powered tool embraces the concept of fuzzy searching. Rather than matching exact terms and requiring specific field values as inputs, it interprets and understands user queries, pinpointing the desired types or categories. Even if certain categories aren’t predefined in the database, users still receive relevant product suggestions. For instance, if a user searches for “low carb options,” even though “low carb” might not be a category saved in the database, the LLM can still match and suggest “keto” products, understanding the user’s intent. This advanced approach greatly enhances the user experience, making product discovery more intuitive.

Content Recommendations in Streaming Services

While many streaming platforms suggest content based on viewing history, LLMs have the power to significantly elevate this experience. If a user searches for “movies similar to Inception with a twist ending,” the LLM can delve into its vast knowledge of movie themes, plots, and reviews, and generate a curated list of recommendations that might not be explicitly tagged in the platform’s database. This nuanced approach can cater to the most specific of viewer moods and preferences, ensuring a seamless entertainment experience.

Recipe Suggestions for Culinary Sites

For culinary platforms or recipe websites, user searches can be incredibly varied, given the diverse range of ingredients and dietary preferences. With an LLM-driven search tool, when a user inputs specific ingredients they have on hand or mentions a particular dietary restriction, the platform can craft unique recipes that aren’t just pulled from a database, but are generated in real-time. An example of this would be: “carrots, chicken, and bok-choy.” Because those ingredients didn’t match an existing saved recipe, the LLM generated:

Quick Carrot, Bok-Choy, and Chicken Sauté

Ingredients:

1 medium-sized carrot, julienned
1 bok-choy, roughly chopped
1 chicken breast, diced
2 tablespoons soy sauce
1 tablespoon vegetable oil
1 clove of garlic, minced
Salt and pepper to taste

Instructions:

In a skillet, heat the vegetable oil over medium heat. Add the diced chicken, seasoning with salt and pepper. Cook until browned.
Add the minced garlic to the skillet, sautéing briefly until fragrant.
Introduce the julienned carrot and chopped bok-choy. Stir-fry for 3-4 minutes, or until the bok-choy is slightly wilted.
Pour in the soy sauce, stirring well to coat the chicken and veggies.
Serve hot with your choice of side, or enjoy as is for a light meal.

So let’s summarize what we’ve already found to be different about searching with LLMs:

What’s Different About Search with LLMs

Search methods powered by LLMs stand out as transformative tools, offering businesses an edge in their information retrieval and customer engagement processes. One of the most compelling features of LLM-driven search is its ability to perform “fuzzy” searches as opposed to the rigid keyword match approach of traditional systems. In layman’s terms, while conventional search mechanisms demand exact phrases or keywords to return relevant results, LLMs can understand and interpret the intent and context behind a query. This means users can pose questions or enter queries in a more natural, conversational manner without being limited to specific keywords. The results are not only more aligned with user intent but also often more comprehensive.

Moreover, LLMs come equipped with an extensive knowledge base derived from the vast amounts of data they’ve been trained on. This expansive, and ever-increasing knowledge base allows them to provide insights, answers, and context that may not even exist in a business’s specific dataset or repository. For businesses, this means tapping into a broader informational spectrum without the need for manual data entry or updates. When integrated into business search tools, LLMs can drastically reduce the gap between user queries and the most relevant, context-rich results. In essence, an LLM-powered search doesn’t just fetch data—it understands, interprets, and often augments it, providing businesses with a dynamic tool that continually adds value.

As we dive deeper into the capabilities of LLMs in search, it’s essential to have a clear understanding of the kind of data these models deal with. Broadly speaking, data can be categorized as either structured or unstructured.

Structured Data refers to information organized in a defined manner, making it easier to search and analyze. This data is typically arranged in rows and columns, akin to what you’d find in databases, spreadsheets, or CSV files. Such a format is convenient for traditional search methods, where specific fields can be queried directly.

Unstructured Data, as the name suggests, lacks a clear structure. This category encompasses a vast array of content, from emails and text documents to social media posts. Searching through this data isn’t as straightforward due to the absence of predefined fields or categories, which is exactly where many conventional search systems fall short.

Given their training on diverse datasets, LLMs excel in parsing and understanding unstructured data, providing contextually relevant results. Where traditional search methods might stumble, LLMs can traverse this complex landscape, delivering insights from unstructured data sources with the same, if not higher, efficiency as they do with structured ones. This proficiency ensures that businesses can unlock the full potential of all their data, irrespective of its format.

The other really interesting aspect of search with LLMs is the range of possibilities it enables on the generative side. Beyond merely retrieving relevant information or documents based on a query, LLMs can generate entirely new content or responses tailored to a user’s specific request. These enhanced capabilities transform the search experience from a passive retrieval process into an active, dynamic interaction. For instance, if a user seeks advice on a particular topic, instead of just presenting pre-existing articles or references, an LLM can synthesize its vast knowledge base to produce a coherent, contextually apt, and unique response on the spot. Such generative prowess can be especially invaluable for businesses aiming to offer real-time solutions, personalized advice, or innovative content suggestions, ensuring they remain a step ahead in delivering unparalleled user experiences.

LLM Search Techniques

The following are some terms you are likely to hear as you dive into search solutions using LLMs:

RAG (Retrieval-Augmented Generation): RAG queries proprietary data and provides those as part of the prompt to fuse relevant documents with the LLMs knowledge base. In other words, first, potential answer-containing documents are fetched. Subsequently, a generative model crafts a detailed answer using the retrieved data. This is table stakes for the kinds of solutions we are describing above.

Initial Query Refinement: Initial query is the starting point in any search process where a user provides an initial query to the system. The quality and specificity of this query can significantly impact the success of the search. Following the initial query, there is often a need to refine the query based on user feedback or additional contextual information to narrow down or better direct the search towards the desired information.

Multiple Passes: This involves making several passes over the data to iteratively refine the search results. Each pass may use the information gleaned from the previous ones to improve the accuracy and relevance of the results.

FLARE (Forward-Looking Active Retrieval augmented generation): FLARE uses the prediction of upcoming sentences to anticipate future content. This anticipated content then serves as a query, guiding the retrieval of pertinent documents. It’s a proactive approach, ensuring that the search mechanism stays a step ahead.

Our expertise lies in navigating through these various techniques in order to determine the best solution across a variety of situations.

Ensuring Accuracy: How to Test Results

Upon deploying AI-driven search tools, validating their accuracy is paramount. Here’s a suggested approach:

Ground Truth Creation: Design a test dataset with established answers or recognized documents to serve as a reference point.

Precision and Recall Metrics: Define a process to assess the precision (the relevance of retrieved documents) and the recall (how many of the relevant documents were fetched). Strive for a balanced outcome.

User Feedback: Enable users to grade the relevance of search outcomes, offering invaluable real-world insights.

A/B Testing: Run the AI-enhanced search solution concurrently with a conventional one, juxtaposing their real-time performances.

Ongoing Monitoring: Continuously gauge the system’s efficacy, making ongoing, iterative adjustments based on evolving data sets or user preferences.

How to Proceed

Our experience has shown us that these initiatives begin with a company (a) reviewing existing search solutions to see what users like and don’t like about them, or (b) looking at proprietary data that they would like to be able to search, or (c) finding places where users would like answers that could be fueled by a combination of an LLM and proprietary data. These all become potential targets that you can evaluate for feasibility/level of effort as well as short- or long-term value. The next step is determining your businesses’ priorities. We often work with clients both prior to, and during these discussions.

With the priorities determined (i.e., high value/low effort), the next step is a Proof of Concept (PoC). This is an early version of the solution that can be expanded over time. Implementations will tend to grow more expensive as you improve them by iterating on prompts, using more advanced techniques and adding testing. However, you often get a 70% solution pretty quickly through a Proof of Concept – and it’s often already better than the existing search. A PoC is a pragmatic step to evaluate the feasibility and results. It gives stakeholders an opportunity to assess the benefits and challenges before making additional investment on iterative improvements.

Assuming you do decide to improve the Proof of Concept, then typically you iterate using an agile process. We recommend that you utilize the steps outlined in the “Ensuring Accuracy” section to validate as you iterate so you can closely monitor the progress being made. Through this iterative process, you’ll be able to address quality issues and optimize results.

Wrapping Up

The era of simplistic keyword searches is rapidly changing over to the advanced capabilities of AI and LLM-driven search techniques – often starting with a parallel implementation of the two – see Google and Bing as prime examples. With the capability to do fuzzy searches across structured and unstructured data and generate different kinds of responses, the results speak for themselves. Even better, initial implementations are often affordable and relatively fast.

As with everything involving LLMs and Generative AI, this is an area that is advancing rapidly. RAG is well established, but methodologies like FLARE and query refinement offer exciting opportunities to make this increasingly powerful. Mechanisms for ensuring accuracy are also improving quickly.

Our team is excited to be working at the forefront of this revolution. We love to talk with people who are considering adopting these approaches. Feel free to reach out and discuss with us.

Category: Uncategorized

Event Details

What You’ll Learn

2-week (10 work-day) AI Coding Tool Ramp-up Spike

Additional Reading:

Announcing the AI Developer Bootcamp

Top Articles

At the top level, I think about:

Here’s a pretty good roundup:

Whichever path you take, the practices you use matter the most.

Reading list

Key Functions with High Impact

Other Notable Functions

Conclusion

Understanding your needs

Here’s what to consider

Potential red flags

Here are some updates from our contributors

Notes

Profile Blurbs and Writing Prompts

Fill Every Textbox

TechEmpower can help

Real-World Examples

Streamlining Internal Documentation Access

Enhancing Front-End Product Search

Content Recommendations in Streaming Services

Recipe Suggestions for Culinary Sites

What’s Different About Search with LLMs

LLM Search Techniques

Ensuring Accuracy: How to Test Results

How to Proceed

Wrapping Up