Vector Database
2023-08-31
What is Approximate Nearest Neighbors (ANN)
This article explains the method known as Approximate Nearest Neighbors (ANN). ANN is a technique used to approximately locate the nearest neighbors of high-dimensional vector data without performing exact calculations. By reducing the computational cost of finding neighboring points within high-dimensional data, this method finds applications in various fields such as similar image retrieval and recommendation systems.
Vector Database
2023-08-30
Chunking in LLM Applications
Effective processing of text is essential for the development of applications utilizing LLM (Large Language Model). This article focuses on "chunking," which is particularly important in this context. Chunking is the process of dividing large text into smaller segments to optimize the relevance of content retrieval from a vector database. The article introduces various chunking techniques and explains factors related to selecting the optimal approach.
Machine Learning
NLP
LLM
Vector Database
2023-08-28
Context Size of LLM and Vector Database
This article introduces the challenges of expanding the context size of LLM and discusses how the accuracy enhancement and cost reduction of LLM are achieved through a vector database.
Vector Database
LLM
2023-08-05
Building a Slack Bot Using ChatGPT Retrieval Plugin
This article introduces the process of building a Slack Bot that responds in the style of ChatGPT, based on custom information using the ChatGPT Retrieval Plugin. The system will be built on Google Cloud.
Machine Learning
NLP
LLM
Vector Database
Pinecone
OpenAI
ChatGPT
FastAPI
Google Cloud
Cloud Run
Cloud Functions
Slack
2023-06-18
Selective Metadata Indexing in Pinecone
This article explains how to configure Pinecone to index specific metadata fields, optimizing memory usage.
Vector Database
Pinecone
2023-06-14
Differentiating Between Upsert and Update in Pinecone
This article explains the distinction between the upsert and update APIs for vector data updates in Pinecone.
Vector Database
Pinecone
2023-06-11
Pinecone Sparse-Dense Vectors
This article introduces the concept of Pinecone Sparse-Dense Vectors, a unique feature of the Pinecone vector database that combines the strengths of semantic and keyword search for enhanced search results.
Vector Database
Pinecone
2023-06-11
ChatGPT Retrieval Plugin
This article introduces the ChatGPT Retrieval Plugin that enables semantic search and retrieval of documents.
Machine Learning
NLP
LLM
Vector Database
OpenAI
ChatGPT
2023-05-15
Metadata in Pinecone
This article discusses the topic of filtering vectors with metadata in Pinecone, allowing you to attach key-value pairs to vectors and perform queries based on filter expressions, resulting in improved search latency and precise retrieval of nearest-neighbor results that match the filters.
Vector Database
Pinecone
2023-05-05
How to Choose Right Pod Type and Size in pinecone
This article provides an introductory understanding of pod selection in Pinecone, explaining the concept of pods, their types, sizes, and how they impact the performance of your service.
Vector Database
Pinecone
2023-05-03
Setting up Pinecone Indexes from Console
This article provides a step-by-step guide on how to set up Pinecone indexes.
Vector Database
Pinecone
2023-05-02
Pinecone
This article provides an overview of Pinecone, a cloud-native vector database tailored for high-performance machine learning applications.
Vector Database
Pinecone
2023-03-30
LLM System Using Vector DB and Proprietary Data
This article explains how to construct a Large Language Model (LLM) system that contains own information.
Machine Learning
NLP
LLM
Vector Database
2023-02-04
Vector Similarity
This article introduces vector similarity measures, including Euclidean distance, cosine similarity, dot product similarity, and Jaccard similarity.
Vector Database
2023-02-03
Vector DB
This article provides an overview of vector databases, their role in handling high-dimensional data, and their applications in machine learning, recommendation systems, and geospatial analysis.
Vector Database
AlloyDB
Amazon Cognito
Amazon EC2
Amazon ECS
Amazon QuickSight
Amazon RDS
Amazon Redshift
Amazon S3
API
Autonomous Vehicle
AWS
AWS API Gateway
AWS Chalice
AWS Control Tower
AWS IAM
AWS Lambda
AWS VPC
BERT
BigQuery
Causal Inference
ChatGPT
Chrome Extension
CircleCI
Classification
Cloud Functions
Cloud IAM
Cloud Run
Cloud Storage
Clustering
CSS
Data Engineering
Data Modeling
Database
dbt
Decision Tree
Deep Learning
Descriptive Statistics
Differential Equation
Dimensionality Reduction
Discrete Choice Model
Docker
Economics
FastAPI
Firebase
GIS
git
GitHub
GitHub Actions
Google
Google Cloud
Google Search Console
Hugging Face
Hypothesis Testing
Inferential Statistics
Interval Estimation
JavaScript
Jinja
Kedro
Kubernetes
LightGBM
Linux
LLM
Mac
Machine Learning
Macroeconomics
Marketing
Mathematical Model
Meltano
MLflow
MLOps
MySQL
NextJS
NLP
Nodejs
NoSQL
ONNX
OpenAI
Optimization Problem
Optuna
Pandas
Pinecone
PostGIS
PostgreSQL
Probability Distribution
Product
Project
Psychology
Python
PyTorch
QGIS
R
ReactJS
Regression
Rideshare
SEO
Singer
sklearn
Slack
Snowflake
Software Development
SQL
Statistical Model
Statistics
Streamlit
Tabular
Tailwind CSS
TensorFlow
Terraform
Transportation
TypeScript
Urban Planning
Vector Database
Vertex AI
VSCode
XGBoost