Changelog

3.2.1 - 2020-10-14

Python API

  • [Performance] 2X to 5X faster multi-threaded hashing of S3 objects (#1816, #1788)

  • [Fixed] Bump minimum required version of tqdm. Fixes a crash (UnseekableStreamError) during upload retry. (#1853)

CLI

  • [Added] Add --meta argument to push (#1793)

  • [Fixed] Fix crash in list-packages (#1852)

3.2.0 - 2020-09-08 - Package Registry Refactor

Python:

  • Refactors local and s3 storage-layer code around a new PackageRegistry base class (to support improved file layouts in future releases)

  • Multithreaded download for large files, large performance gains when installing packages with large files, especially on large instances

  • Package name added to Package.resolve_hash

  • Bugfix: remove package revision by shorthash

  • Performance improvements for build and push

Catalog & Lambdas:

  • PDF previews

  • Browse full package contents (no longer limited to 1000 files)

  • Indexing and search package-level metadata

  • Fixed issue with download button for certain text files

  • FCS files: content indexing and preview

  • Catalog sign-in with email (or username)

  • Catalog support for sign-in with Okta

3.1.14 - 2020-06-13 - Python API features, fixes, catalog capabilities, backend optimizations

Catalog

  • .cef preview

  • allow hiding download button

  • only show stats for 2-level extensions for .gz files

Python

  • quilt3.logged_in()

  • fix retries during hashing

  • improve progress bars

  • fix quilt3 catalog

  • expanded documentation

  • reduce pyyaml requirements to prevent version conflicts

Backend

  • improve unit test coverage for indexing lambdas

  • fix real-time delete handling (incl. for unversioned objects)

  • handle all s3:ObjectCreated: and ObjectRemoved: events (fixes ES search state and bucket Overview)

3.1.13 - 2020-04-15 - Windows Support

Python API

  • Official support for Windows

  • Add support for Python 3.7, 3.8

  • Fix Package import in Python

  • Updated libraries for stability and security

  • Quiet TQDM for log files ($ export QUILT_MINIMIZE_STDOUT=true )

  • CLI setting of config parameters

Catalog

  • new feature to filter large S3 directories with regex

  • more reliable bucket region inference

  • Support preview of larger Jupyter notebooks in S3 (via transparent GZIP)

  • JS (catalog) dependencies for stability and security

  • extended Parquet file support (for files without a .parquet extension)

  • Improvements to catalog signing logic for external and in-stack buckets

Special thanks to @NathanDeMaria (CLI and Windows support) and @JacksonMaxfield for contributing code to this release.

3.1.12 - 2020-03-11 - Command line push

Python

  • Add push to CLI

3.1.11 - 2020-03-10 - New command line features, bug fixes

Catalog

  • Updated JS dependencies

  • Display package truncation warning in Packages

Python

  • quilt3 install foo/bar/subdirectory

  • Bug fixes for CopyObject and other exceptions

3.1.10 - 2020-01-29 - 3.1.10

Python Client

  • Fix bug introduced in 3.1.9 where uploads fail due to incorrect error checking after a HEAD request to see if an object already exists (#1512)

3.1.9 - 2020-01-29 - Release 3.1.9

Python Client

  • quilt3 install now displays the tophash of the installed package (#1461)

  • Added quilt3 --version (#1495)

  • Added quilt3 disable-telemetry CLI command (#1496)

  • CLI command to launch catalog directly to file viewer - quilt3 catalog $S3_URL (#1470, #1487)

  • No longer run local container for quilt3 catalog (#1504). See (#1468, #1483, #1482) for various bugs leading to this decision.

  • Add PhysicalKey class to abstract away local files vs unversioned s3 object vs versioned s3 object (#1456, #1473, #1478)

  • Changed cache directory location (#1466)

  • More informative progress bars (#1506)

  • Improve support for downloading from public buckets (#1503)

  • Always disable telemetry during tests (#1494)

  • Bug fix: prevent misleading CLI argument abbreviations (#1481) such as --to referring to --tophash

  • Bug fix: background upload/download threads are now killed if the main thread is interrupted (#1486)

  • Performance improvements: load JSONL manifest faster (#1480)

  • Performance improvement: If there is an error when copying files, fail quickly (#1488)

Catalog

  • Better package listing UX (#1462)

  • Improve bucket stats visualization when there are many categories (#1469)

3.1.8 - 2019-12-20 - Catalog Command Fixes and Performance Improvements

Python API

  • Bug-fixes for quilt3.config and quilt3.catalog

  • Performance improvements for Packages

Catalog

  • Updated landing page

3.1.7 - 2019-12-13 - Package Cache

Catalog

  • New LOCAL mode for running the catalog on localhost

Python API

  • quilt3 catalog command to run the Quilt catalog on your local machine

  • quilt3 verify compares the state of a directory to the contents of a package version

  • Added a local file cache for installed packages

  • Performance improvements for upload and download

  • Support for short hashes to identify package versions

  • Adding telemetry for API calls

3.1.6 - 2019-12-03 - Package rollback

API Improvements

  • Implement Package.rollback

  • Drop support for object metadata (outside of packages)

  • Change the number of threads used when installing and pushing from 4 to 10 (S3 default)

  • Misc bug fixes

3.1.5 - 2019-11-20 - Catalog and API improvements

Catalog

  • Fix package listing for packages with more 100 revisions

  • Add stacked area charts for downloads

  • 2-level file-extensions for bucket summary

Python

  • Fix uploads of very large files

  • Remove unnecessary copying during push

3.1.4 - 2019-10-17 - Release v3.1.4

3.1.3 - 2019-10-11 - Release v3.1.3

  • Bug fix: when adding python objects to a package a temporary file would be created and then deleted when the object was pushed, leading to a crash if you tried to push that package again (PR #1264)

3.1.2 - 2019-10-11

  • Added support for adding an in-memory object (such as a pandas.DataFrame) to a package via package.set()

  • Fix to work with pyarrow 0.15.0

  • Performance improvements for list_packages and delete_package

  • Added list_package_versions function

3.0.0 - 2019-05-24 - Quilt 3 (formerly Quilt T4) Initial Release

This is the initial release of the new and improved Quilt 3 (formerly Quilt T4). For more information refer to the documentation.

2.9.15 - 2019-01-09 - Teams Config

Compiler

Adds a feature to allow quilt config to set a registry URL for a private Teams registry.

2.9.14 - 2018-12-20 - Push Package by Hash

Compiler

  • Adding a hash argument to quilt.push to allow pushing any package version to a registry.

Registry

  • Make object sizes required.

  • Update urllib3 version for security patch

Docs

  • Improved instructions for running registries.

2.9.13 - 2018-11-12 - Fix ascii decoding bug

  • Fix an ascii decoding issue related to ellipses …

2.9.12 - 2018-10-11 - Pyarrow 0.11 compatibility

Make Quilt work with pyarrow 0.11

  • Update Parquet reading code to match the API change in pyarrow 0.11.

  • Fix downloading of zero-byte files

2.9.11 - 2018-09-11 - Save Objects to Existing Packages

Compiler

  • New helper function quilt.save adds an object (e.g., a Pandas DataFrame) to an existing package by performing a sub-package build and push in a single step

  • BugFix: quilt.load now correctly returns sub-packages (fixes issue #741)

Registry

  • Send a welcome email to new users after activation

2.9.10 - 2018-08-08 - Minor updates and improved documentation

Compiler

  • fixes an issue with packages created on older versions of pyarrow

  • improves readability for quilt inspect

  • allow adding a node with metadata using sub-package build/push

Registry

  • adds documentation for running a private registry in AWS

2.9.9 - 2018-07-31 - Bug fixes

  • Suppress numpy warnings under Python 2.7

  • Fix subpackage build and push

2.9.8 - 2018-07-30 - Flask-internal Authentication

Compiler

  • Added support for sub-package build and push to allow updates to allow adding nodes to large packages without materializing the whole package

  • First-class support for ndarray

Registry

  • Replaced dependence on external OAuth2 provider with a built-in authentication and session management

  • Registry support for sub-package push

Catalog

  • Updated to support new registry authentication

2.9.7 - 2018-07-11 - Asa extensions

Compiler

  • added Bracket accessor for GroupNodes

  • asa.plot to show images in packages

  • asa.torch to convert packages to PyTorch Datasets

  • Enforce fragment store as read-only

Catalog

  • Added source maps and CI for catalog testing

2.9.6 - 2018-06-13 - Documentation and Bugfixes

Documentation

Expands and improves documentation for working with Quilt packages.

Bug fixes and small improvements

  • Load packages by hash

  • Choose a custom loader for DataNodes with asa=

Registry

  • Specify Ubuntu version in Dockerfiles

2.9.5 - 2018-05-23 - Package Filtering

Catalog

  • display package traffic stats in catalog

Compiler

  • filter packages based on per-node metadata

  • get/set metadata for package nodes

  • support custom loaders in the _data method

Registry

  • package commenting

2.9.4 - 2018-04-20 - Metadata only package install

Compiler

  • Metadata-only package install

  • Build DataFrames from existing Parquet files

  • Remove HDF5 dependencies

  • Code cleanup and refactoring

Registry

  • Option for metadata-only package installs

  • New endpoint for fetching missing fragments (e.g., from partially installed packages)

  • Improved full-text search

2.9.3 - 2018-03-20 - Package Composition

Compiler:

  • Allow building packages out of other packages and elements from other packages. A new build-file keyword, package inserts a package (or sub-package) as an element in the package being built.

Catalog:

  • Upgrade router and other dependencies

  • Display packages by author

2.9.2 - 2018-03-01 - Quilt Teams

Catalog Changes to support private registries

  • Amin UI for controlling users and access

  • Auditing views

Globbing for package builds

  • Allow specifying sets of input files in build.yml

Command-line support for private registries

  • Specify teams packages

  • Admin commands to create and activate/deactivate users

2.9.1 - 2018-02-06 - Better Progress Bar

Version 2.9.1 introduces a better progress bar for installing (downloading) Quilt packages. Quilt push now sends objects' uncompressed size to the registry. The progress bar is now based on the total bytes downloaded instead of the number of files.

2.9.0 - 2018-02-02 - Shared Local Package Storage

Shared Local Package Storage

Import packages from shared local directories to save storage overhead and network traffic when sharing packages on the same local network.

Registry Install Stats

Log package installs in the registry to display stats on package use.

New Python API commands

  • generate

  • rm

  • search

Drop support for Python 3.4

(BETA) Team Registries

Updates to commands and local storage to allow users to connect to different registries to support teams running private registries for internal sharing.

2.8.4 - 2018-01-24 - Fix download retry

Fixes a bug in download that prevented retrying failed downloads.

2.8.3 - 2018-01-19 - Remove Unneeded Pandas dependency

186 introduced an undeclared dependency on Pandas >= 0.21.0 (by catching ParserError during CSV parsing). This release removes that dependency and resolves #291.

2.8.2 - 2018-01-17 - Hotfix for several quilt commands

PR https://github.com/quiltdata/quilt/pull/290

2.8.1 - 2018-01-10 - Add Quilt Catalog

Quilt Catalog

Source for the Quilt data catalog is now included in this repository.

MySQL->Postgres

Ported the Quilt registry from MySQL to Postgres

Docker Compose

Improvements to the docker configuration that allows running the registry, catalog, database and authentication service from Docker compose.

Parallel Download

Data fragments can now be downloaded in parallel leading to much faster package installs for large packages.

2.8.0 - 2017-12-07 - Centralized Local Package Store

Release Highlights

Quilt packages live in a centralized location on your machine

Quilt data packages are now available wherever you run Python. We recommend that users quilt push all local packages to the registry before upgrading. Further details on migration are here.

Faster builds with build cache

Quilt now caches build intermediates. So if you wish to update the README of a multi-gigabyte package, you can rebuild the entire package in one second.

Group-level build parameters

You can now specify build parameters (like transform) for all children of a group in one shot. The updated syntax and docs are here.

quilt.yml is like requirments.txt but for data

You can now express dependencies on multiple packages in a single file. Docs here.

Experimental: build a package from a GitHub repo

Quilt build now accepts GitHub URLs. If you use data stored on GitHub you can turn it into a Quilt package with quilt build.

2.7.1 - 2017-11-09 - Checks: unit tests for data packages

Version 2.7.1 includes several minor bug fixes and one new feature, checks. Checks allow a user to specify data integrity checks that are enforced during quilt build.

2.7.0 - 2017-08-18 - Subpackages and more efficient uploads/downloads

  • Support installing subpackages as quilt install usr/pkg/path

  • Upload fragments in parallel

  • Use http sessions when accessing S3

2.6.3 - 2017-07-22 - Clear session to prevent quilt.login() bugs in Jupyter

2.6.1 - 2017-07-20 - Package Delete

This release adds a new command to delete a package including all versions and history from the registry.

2.6.0 - 2017-07-14 - Fast Builds for Large Packages

Building a package from a directory of input files now skips generating a build file. That speeds up the build process and makes it easier to change the package contents and rebuild.

2.5.1 - 2017-07-06 - Push Public Packages

This release includes support for paid plans on quiltdata.com and is recommended for all individual and business-plan users. It adds a shortcut to push packages and make them public in a single command and improves documentation.

Contents
3.2.1 - 2020-10-14
Python API
CLI
3.2.0 - 2020-09-08 - Package Registry Refactor
Python:
Catalog & Lambdas:
3.1.14 - 2020-06-13 - Python API features, fixes, catalog capabilities, backend optimizations
Catalog
Python
Backend
3.1.13 - 2020-04-15 - Windows Support
Python API
Catalog
3.1.12 - 2020-03-11 - Command line push
3.1.11 - 2020-03-10 - New command line features, bug fixes
3.1.10 - 2020-01-29 - 3.1.10
Python Client
3.1.9 - 2020-01-29 - Release 3.1.9
Python Client
Catalog
3.1.8 - 2019-12-20 - Catalog Command Fixes and Performance Improvements
Python API
Catalog
3.1.7 - 2019-12-13 - Package Cache
Catalog
Python API
3.1.6 - 2019-12-03 - Package rollback
API Improvements
3.1.5 - 2019-11-20 - Catalog and API improvements
Catalog
Python
3.1.4 - 2019-10-17 - Release v3.1.4
3.1.3 - 2019-10-11 - Release v3.1.3
3.1.2 - 2019-10-11
3.0.0 - 2019-05-24 - Quilt 3 (formerly Quilt T4) Initial Release
2.9.15 - 2019-01-09 - Teams Config
Compiler
2.9.14 - 2018-12-20 - Push Package by Hash
Compiler
Registry
Docs
2.9.13 - 2018-11-12 - Fix ascii decoding bug
2.9.12 - 2018-10-11 - Pyarrow 0.11 compatibility
Make Quilt work with pyarrow 0.11
2.9.11 - 2018-09-11 - Save Objects to Existing Packages
Compiler
Registry
2.9.10 - 2018-08-08 - Minor updates and improved documentation
Compiler
Registry
2.9.9 - 2018-07-31 - Bug fixes
2.9.8 - 2018-07-30 - Flask-internal Authentication
Compiler
Registry
Catalog
2.9.7 - 2018-07-11 - Asa extensions
Compiler
Catalog
2.9.6 - 2018-06-13 - Documentation and Bugfixes
Documentation
Bug fixes and small improvements
Registry
2.9.5 - 2018-05-23 - Package Filtering
Catalog
Compiler
Registry
2.9.4 - 2018-04-20 - Metadata only package install
Compiler
Registry
2.9.3 - 2018-03-20 - Package Composition
Compiler:
Catalog:
2.9.2 - 2018-03-01 - Quilt Teams
2.9.1 - 2018-02-06 - Better Progress Bar
2.9.0 - 2018-02-02 - Shared Local Package Storage
Shared Local Package Storage
Registry Install Stats
New Python API commands
Drop support for Python 3.4
(BETA) Team Registries
2.8.4 - 2018-01-24 - Fix download retry
2.8.3 - 2018-01-19 - Remove Unneeded Pandas dependency
186 introduced an undeclared dependency on Pandas >= 0.21.0 (by catching ParserError during CSV parsing). This release removes that dependency and resolves #291.
2.8.2 - 2018-01-17 - Hotfix for several quilt commands
2.8.1 - 2018-01-10 - Add Quilt Catalog
Quilt Catalog
MySQL->Postgres
Docker Compose
Parallel Download
2.8.0 - 2017-12-07 - Centralized Local Package Store
Release Highlights
Quilt packages live in a centralized location on your machine
Faster builds with build cache
Group-level build parameters
quilt.yml is like requirments.txt but for data
Experimental: build a package from a GitHub repo
2.7.1 - 2017-11-09 - Checks: unit tests for data packages
2.7.0 - 2017-08-18 - Subpackages and more efficient uploads/downloads
2.6.3 - 2017-07-22 - Clear session to prevent quilt.login() bugs in Jupyter
2.6.1 - 2017-07-20 - Package Delete
2.6.0 - 2017-07-14 - Fast Builds for Large Packages
2.5.1 - 2017-07-06 - Push Public Packages