- Recipes
- Amazon S3 CSV file to Pinecone
Connect Amazon S3 CSV file and Pinecone in our serverless environment
Use this template to Read CSV file entries from Amazon S3 bucket using them to upserts vectors into a Pinecone index.
Share
Read CSV file entries from Amazon S3 bucket
Used integrations:
JavaScript
Python
class AwsS3SourceReadRemoteCsv {
async init() {
// TODO: Create your aws-s3 credential
// More info at https://yepcode.io/docs/integrations/aws-s3/#credential-configuration
this.awsS3 = yepcode.integration.awsS3("your-aws-s3-credential-name");
}
async fetch(publish, done) {
// TODO: Customize the command to be executed
// More info at: https://docs.aws.amazon.com/AWSJavaScriptSDK/v3/latest/clients/client-s3/interfaces/getobjectcommandinput.html
const getObjectCommand = new GetObjectCommand({
Bucket: "your-bucket-name",
Key: "your-file-name",
});
const getObjectResult = await this.awsS3.send(getObjectCommand);
// Result.Body is a stream of object content
getObjectResult.Body.pipe(
csv.parse({
delimiter: ",",
columns: true,
})
)
.on("data", publish)
.on("end", done);
}
async close() {}
}
import csv
import io
class AwsS3SourceReadRemoteCsv:
def setup(self):
# TODO: Create your S3 credential:
# More info at https://yepcode.io/docs/integrations/aws-s3/#credential-configuration
self.aws_s3_client = yepcode.integration.awsS3("your-s3-credential-name")
# TODO: If your csv does no have headers, you can define them here as a list:
# self.fieldnames = ["column1", "column2", "column3"]
self.fieldnames = None
def generator(self):
# TODO: Customize your bucket name and object key
# See all supported params in: https://boto3.amazonaws.com/v1/documentation/api/latest/reference/services/s3/client/get_object.html
response = self.aws_s3_client.get_object(
Bucket="your-bucket-name",
Key="your-file-name",
)
bytes_stream = response["Body"]
csv_file_stream = io.TextIOWrapper(bytes_stream, encoding="utf-8")
reader = csv.DictReader(
csv_file_stream, delimiter=",", fieldnames=self.fieldnames
)
for row in reader:
yield row
def close(self):
pass
Do you need help solving this integration with YepCode?
Let's talkUpserts vectors into a Pinecone index
Used integrations:
JavaScript
Python
class PineconeTargetUpsertVectors {
async init() {
// TODO: Create your pinecone credential
// More info at https://yepcode.io/docs/integrations/pinecone/#credential-configuration
this.pineconeClient = yepcode.integration.pinecone(
"your-pinecone-credential-name"
);
// TODO: Select the index name you want to recover.
// More info at https://docs.pinecone.io/docs/node-client#index
this.index = await pineconeClient.Index("your-index-name");
this.vectors = [];
}
async consume(item) {
// TODO: Save vectors you want to upsert into a namespace.
this.vectors.push({
id: item.id,
values: item.values,
metadata: item.metadata,
});
}
async close() {
// TODO: Write vectors into a namespace.
// More info at https://docs.pinecone.io/docs/node-client#indexupsert
await this.index.upsert({
upsertRequest: {
vectors: this.vectors,
namespace: "your-namespace",
},
});
}
}
class PineconeTargetUpsertVectors:
def setup(self):
# TODO: Create your Pinecone credential:
# More info at https://yepcode.io/docs/integrations/pinecone/#credential-configuration
self.pinecone_client = yepcode.integration.pinecone(
"your-pinecone-credential-name"
)
# TODO: Select the index name you want to recover
# More info at: https://docs.pinecone.io/docs/python-client#index
self.index = self.pinecone_client.Index("your-index-name")
self.vectors = []
def consume(self, generator):
# TODO: Save vectors you want to upsert into a namespace.
for item in generator:
self.vectors.append(
{
"id": item.id,
"values": item.values,
"metadata": item.metadata,
}
)
def close(self):
# TODO: Write vectors into a namespace.
# More info at: https://docs.pinecone.io/docs/python-client#indexupsert
self.index.upsert(self.vectors, namespace="your-namespace")
pass
FAQs
YepCode is a SaaS platform that allows to create, execute and monitor integrations and automations using source code in a serverless environment.
We like to call it the Zapier for developers, since we bring all the agility and benefits of NoCode tools (avoid server provisioning, environment configuration, deployments,...), but with all the power of being able to use a programming language like JavaScript or Python.
These recipes are a good starting point for you to build your own YepCode processes and solve your integration and automation problems.
You only have to fill the sign up form and your account will be created with our FREE plan (no credit card required).
YepCode has been created with a clear enterprise approach (multi-tenant environment, team management, high security and auditing standards, IdP integrations, on-premise options,...) so we can be the Swiss army knife of any team of engineering, especially those that need to extract or send information to external systems, and where a certain dynamism or adaptation to change is necessary in that process.
Sure! You just need to do some configuration to allow YepCode servers to connect to that service. Check our docs page to get more information.