Connect Google BigTable and Google Storage NDJSON file in our serverless environment

Use this template to Read rows from Google BigTable using them to create NDJSON file entries in Google Storage Bucket.

Read rows from Google BigTable

Used integrations:

google-bigtable

JavaScript
Python

class GoogleBigtableSourceGetRows {
    async init() {
        // More info at https://www.npmjs.com/package/@google-cloud/bigtable
        // TODO: Create team variables. More info at https://yepcode.io/docs/processes/team-variables
        this.googleBigTable = new Bigtable({
            projectId: yepcode.env.GOOGLE_BIGTABLE_PROJECT_ID,
            credentials: JSON.parse(yepcode.env.GOOGLE_BIGTABLE_SERVICE_ACCOUNT_JSON),
        });
        // TODO: Customize your instance and table names
        this.table = await this.googleBigTable
            .instance("your-instance-name")
            .table("your-table-name");
    }

    async fetch(publish, done) {
        // You can customize get options. eg: get matching keys
        // Look at: https://cloud.google.com/nodejs/docs/reference/bigtable/latest/bigtable/getrowsoptions
        const getRowsOptions = {};
        const [rows] = await this.table.getRows(getRowsOptions);

        for (const rowResponse of rows) {
            const [row] = await rowResponse.get();
            const item = this._mapRowToItem(row);
            await publish(item);
        }
        done();
    }

    _mapRowToItem(row) {
        const columnsAndLastValue = Object.entries(row.data).map(
            ([columnName, columnContent]) => [
                columnName,
                this._getLastColumnValue(columnContent),
            ]
        );
        return Object.fromEntries(columnsAndLastValue);
    }

    _getLastColumnValue(columnContent) {
        // The second [0] is to access the last value
        // You can access previous values by accessing the next
        // elements in the list
        return columnContent[0][0].value;
    }

    async close() {}
}

import json
from google.cloud.bigtable.client import Client
from google.oauth2.service_account import Credentials
from google.cloud.bigtable import row_filters

class GoogleBigtableSourceGetRows:
    def setup(self):
        # More info at https://pypi.org/project/google-cloud-bigtable
        # TODO: Create team variables. More info at https://yepcode.io/docs/processes/team-variables
        credentials_dict = json.loads(yepcode.env.GOOGLE_APPLICATION_CREDENTIALS)
        self.big_table_client = Client(
            project=yepcode.env.GOOGLE_PROJECT_ID,
            credentials=Credentials.from_service_account_info(credentials_dict)
        )

    def generator(self):
        # TODO: Customize instance id, table id and filters if needed
        instance = self.big_table_client.instance("instance-id")
        table = instance.table("table_id")
        filter = row_filters.RowFilterChain(
            filters=[
                row_filters.FamilyNameRegexFilter("column-family-id"),
                row_filters.ColumnQualifierRegexFilter("column-qualifier"),
                row_filters.ValueRegexFilter("value"),
            ]
        )
        rows = table.read_rows(filter_=filter)

        # TODO: Customize the item to yield
        for row in rows:
            item = {"row_key": row.row_key.decode('utf-8')}
            for cf, cols in row.cells.items():
                for col, cells in cols.items():
                    for cell in cells:
                        # This will only get the latest version of the cell value
                        item[f"{cf}:{col}"] = cell.value.decode('utf-8')
            yield item

    def close(self):
        pass

Do you need help solving this integration with YepCode?

Let's talk

Create NDJSON file entries in Google Storage Bucket

Used integrations:

google-storage

JavaScript
Python

class GoogleStorageTargetUploadNdjson {
    async init() {
        // More info at https://www.npmjs.com/package/@google-cloud/storage
        // TODO: Create team variables. More info at https://yepcode.io/docs/processes/team-variables
        const googleStorage = new Storage({
            projectId: yepcode.env.GOOGLE_STORAGE_PROJECT_ID,
            credentials: JSON.parse(yepcode.env.GOOGLE_STORAGE_SERVICE_ACCOUNT_JSON),
        });
        // TODO: Customize your bucket name
        const bucket = googleStorage.bucket(yepcode.env.GOOGLE_STORAGE_BUCKET_NAME);
        // TODO: choose a destination file
        const blob = bucket.file(`one-folder/my-filename-${Date.now()}.ndjson`);
        const ws = blob.createWriteStream({
            metadata: {
                contentType: "text/ndjson",
            },
        });

        // Transforms the items into a ndjson format
        this.stringifier = ndjson.stringify();
        this.stringifier.pipe(ws);
    }

    async consume(item) {
        this.stringifier.write(item);
    }

    async close() {
        this.stringifier.end();
    }
}

import ndjson
import io
import json
import google.cloud.storage
from google.oauth2.service_account import Credentials

class AccumulatingStream:
    def __init__(self):
        self.data = io.BytesIO()

    def write(self, item):
        self.data.write(item.encode("utf-8"))

    def get_stream(self):
        self.data.seek(0)
        return self.data

class GoogleStorageTargetUploadNdjson:
    def setup(self):
        # More info at https://pypi.org/project/google-cloud-storage
        # TODO: Create team variables. More info at https://yepcode.io/docs/processes/team-variables
        credentials_dict = json.loads(yepcode.env.GOOGLE_APPLICATION_CREDENTIALS)
        self.storage_client = google.cloud.storage.Client(
            project=yepcode.env.GOOGLE_PROJECT_ID,
            credentials=Credentials.from_service_account_info(credentials_dict)
        )

        self.acc_stream = AccumulatingStream()
        self.stringifier = ndjson.writer(self.acc_stream, ensure_ascii=False)

    def consume(self, generator, done):
        for item in generator:
            self.stringifier.writerow(item)
        done()

    def close(self):
        # TODO: customize the bucket name and object key
        try:
            bucket = self.storage_client.get_bucket("bucket_name")
            blob = bucket.blob("object_key")

            # Upload from a stream
            blob.upload_from_file(self.acc_stream.get_stream())
        except Exception as error:
            print(f"Error uploading object: {error}")

Other combinations

View recipes

FAQs

What is YepCode?

YepCode is a SaaS platform that enables the creation, execution and monitoring of integrations and automations using source code in a serverless environment.

We like to call it the Zapier for developers, since we bring all the agility and benefits of NoCode tools (avoid server provisioning, environment configuration, deployments,...), but with all the power of being able to use a programming language like JavaScript or Python.

How these recipes may help me?

These recipes are an excellent starting point for creating your own YepCode processes and solving complex integration and automation problems.

How can I use YepCode?

You only have to complete the sign up form and your account will be created with our FREE plan (no credit card required).

Is YepCode just for single developers or may be used by enterprises?

YepCode has been created with a clear enterprise focus, offering a multi-tenant environment, team management capabilities, high security and auditing standards, Identity Provider (IdP) integrations, and on-premise options. It serves as the Swiss army knife for engineering teams, especially those requiring the extraction or transmission of information to external systems. It excels in scenarios demanding flexibility and adaptability to change within the process.

Can YepCode work with my on-premise services?

Sure! You only need to configure YepCode servers to establish a connection with that service. Check our docs page to get more information.

Connect Google BigTable and Google Storage NDJSON file in our serverless environment

Share

Read rows from Google BigTable

Used integrations:

Do you need help solving this integration with YepCode?

Create NDJSON file entries in Google Storage Bucket

Used integrations:

Other combinations

Related recipes

FAQs

What is YepCode?

How these recipes may help me?

How can I use YepCode?

Is YepCode just for single developers or may be used by enterprises?

Can YepCode work with my on-premise services?