Skip to content

Accelerating queries

For enhancing the query performance, a user can cache the data using Flash Stack.

For example:

A user wants to reduce the time it takes to get query results, so they cache the dataset in the lakehouse Depot using Flash Service. This ensures faster query results whenever the dataset is queried. The below manifest file defines a Service named flashtest. The Service caches a dataset named city, stored in dataos://lakehouse:retail/city, and initializes a table mycity by selecting data from retail.city.

name: flashtest
version: v1
type: service
tags:
    - service
description: Flash service
workspace: public
service:
  servicePort: 8080
  servicePorts:
    - name: backup
      servicePort: 5433  
  ingress:
    enabled: true
    stripPath: false
    path: /flash/public:flashtest
    noAuthentication: true
  replicas: 1
  logLevel: info
  compute: runnable-default
  envs:
    APP_BASE_PATH: 'dataos-basepath'
    FLASH_BASE_PATH: /flash/public:flashtest
  resources:
    requests:
        cpu: 1000m
        memory: 1024Mi
  stack: flash+python:2.0
  stackSpec:
# Datasets
    datasets:
      - name: city
        address: dataos://lakehouse:retail/city

    init:
      - create table mycity as (select * from retail.city)

    # schedule:
    #   - expression: "*/2 * * * *"
    #     sql: INSERT INTO mycustomer BY NAME (select * from customer);