Riak Test v1.1.0 #786

jburwell · 2015-04-16T14:50:18Z

This PR is not intended to be merged at this time. It is being used to review the work done thus far, and begin the process of cleanup for merge into master.

Riak Test v1.1.0 ... more writeup to come.

/cc @javajolt

jburwell · 2015-04-16T15:09:01Z

regression_test_wrapper.sh

@@ -0,0 +1,38 @@
+#!/bin/bash


Should we remove this script prior to merge? We had always intended to remove it, but it may prove useful until for verifying changes until the v1 wrapper is dropped in v2.0.0.

I think it's still useful for now. Easy to remove later.

jburwell · 2015-04-16T16:20:35Z

Notes on the planner design

rt_planner and rt_reporter are good starts towards the pipeline design I have in mind for r_t. After reviewing the initial implementation, I have the following observations:

The GiddyUp dependency can be further isolated in the codebase
The riak_test_escript module contains more test planning work than necessary
Isolate the use of rt_config to the riak_test_escript module as much as possible to simplify component unit testing

Ideally, we have a pipeline constructed with the following stages:

planner->scheduler->runner

Each of these stages can emit a test_result event that is broadcast to a list of reporters (e.g. console, giddyup).

A planner stage accepts a list of tests, backends, and groups and creates a set of test plans to be scheduled for execution. The following is (roughly) the test planner API I have in mind:

-spec start_link(pid(), [pid()], proplists:proplist() -> rt_util:result().
start_link(SchedulerPid, ReporterPids, Options) -> %% the options are implementation dependent

-spec plan([atom()], [backend()], [atom()]) -> {ok, non_neg_integer()} | rt_util:error().
plan(Tests, Backends, Groups) ->  %% When the test list is empty, run all available tests

A scheduler stage accepts a test plan, reserves the quantity of nodes requested by the plan, and submits the schedule to an executor stage for execution. The following is (roughly) the test scheduler API I have in mind:

-spec start_link(pid(), [pid()], proplists:proplist()).
start_link(ExecutorPid, ReporterPids, Options) -> %% Options are implementation dependent

-spec schedule(#rt_planner:test_plan()) -> rt_util:result().
schedule(TestPlan) ->

Finally, a Runner stage provisions the nodes allocated in the test schedule, executes the test specified in the test schedule, and reports the results of the test run the to reporters. The following is (roughly) the API I have in mind:

-spec start_link([pids()], proplists:proplist()) -> rt_util:result().
start_link(ReporterPids, Options) ->

-spec run(rt_scheduler:test_schedule) -> rt_util:result().
run(TestSchedule) ->

Based on this (high-level) design, I propose the following refactorings:

Convert the giddyup module to a gen_server that has the giddyup configuration information passed into its start_link function. In addition to encapsulating the GiddyUp configuration, it will also allow asynchronous communication with GiddyUp to prevent blocking of the pipeline if/when we have a slow connection.
Refine the rt_planner module to define the planner behavior and any common functions used by all planners
Move all GiddyUp related planning functions into a new rt_giddyup_planner module -- removing the notion of wrapping test plans in the escript module. If a list of tests, backends, and/or groups is provided to this planner, they will be used to filter the list of test plans retrieved from GiddyUp.
Create an rt_simple_planner module which is used when a list of tests is specified on the command line.
Add a build_test_pipeline function to the riak_test_escript that selects and starts the planner, scheduler, and reporter(s) and ties them together for execution.

/cc @javajolt

* Change riak_test API to add a properties and setup function in addition to confirm. The goal is to remove test environment specification from actual test case logic. * Change confirm/0 to confirm/2 to accept data about the test environment as input. * Add capability for any test to be run as a rolling upgrade test. * Add rt_cluster module with some supporting functions. * Change verify_build_cluster and secondary_index_tests modules to conform to the new API in order to demonstrate the changes.

- functions include set_conf/2, set_advanced_conf/2, and update_app_config/2.

- Move rt module functions to new rt_http module. - Convert http_bucket_types to new test convention.

- Have lager fire up an extra handler for each test run - Place lager output in riak_test.log and upload to GiddyUp - Remove rt.hrl and move the rt_webhook record into giddyup.erl - Remove unused GiddyUp code from escript - Let the test_runner know the name of the log directory so it knows where to put the riak_test.log - Add separate flag to rt_reporter to indicate the need to upload files to GiddyUp - Always copy log files to local directory before uploading to GiddyUp

- Always copy log files to local directory - Removed unused command-line options in escript - Get rid of extraneous commented-out code - Move as much GiddyUp-specific logic into `giddyup' - Correctly upload config files - Upload files asynchronously to scheduler can reclaim used nodes

Add a few notes about debugging and version info

…ddyup_host is not set in config file

Add comment and example of `continue_on_fail'

can be run

rt_harness_util:deploy_nodes/5 to handle cuttlefish

- Default backend in rt_planner is now undefined - All lists of backends are now atoms - Command-line limits backends when using GiddyUp - Command-line now properly generates multiple tests when a list of backends is specified - "multi" backend added for verify_2i_aae-multi for GiddyUp

tuples when building a cluster

- rt_host defines a protocol for performing operations on a host (e.g. exec, mkdir, etc). Adds rt_local_host for working on a local host - Moves various general purpose functions to rt_util - Introduces rt_driver to define extension points for product specific test planning and scheduling activities

- Required to get legacy upgrade tests to pass (e.g. BTA-231, BTA-232) - Upgrade path name is in rt_test_plan currently for execution - Fix get_node_logs/0 for verify_handoff_mixed (BTA-75), yz_rs_migration_test and yz_solr_start_timeout - Fix rtdev:node_id/0 for kv679_dataloss (BTA-218) and verify_riak_stats (BTA-174)

- Changed hard-coded devrel path in test - Update relpath/1 to resolve new version paths

cluster construction from rt_riak_cluster.

jburwell reviewed Apr 16, 2015
View reviewed changes

kellymclaughlin and others added 20 commits April 22, 2015 13:40

convert bucket_types test to decoupled r_t framework

8348835

convert basic_command_line test to decoupled r_t framework

e69f962

convert bucket_types test to decoupled r_t framework

86b8c6f

convert bucket_props_roundtrip test to decoupled r_t framework

5fe4c03

Move replication tests to separate subdirectory

381ff9f

Move rebar plugin to separate directory

d15efa0

Add missing node function to rt.erl

333e67d

Remove direct calls to rtdev harness from replication tests

c7c8dfc

Move configuration functions to rt_config.

263cd68

- functions include set_conf/2, set_advanced_conf/2, and update_app_config/2.

Continue to move config functions from rt to rt_config.

ba4ee80

Migrate several more functions into rt_cluster from rt.

ff24f91

Move backend related functions from rt module to new rt_backend module.

1267503

Move protobuf-related functions from rt module to rt_pb.

6267aa2

Fix rt_pb function exports.

8c9f6f1

Refactor http-relate rt functions

defdf55

- Move rt module functions to new rt_http module. - Convert http_bucket_types to new test convention.

Move node-related functions from rt module to rt_node.

56b10d7

Move rt:brutal_kill to rt_node.

e1f0d45

Move ring-related rt functions to rt_ring; some cleanup.

891f366

Move command-line oriented rt functions to rt_cmd_line.

80005cd

Brett Hazen added 5 commits April 22, 2015 14:37

Re-sync all tests from master

62f03fd

Fix creation of test plans from GiddyUp

a529f17

Report full test names when showing which tests to run or not run

d50506c

hazen force-pushed the feature/mixed-mode-riak-test branch from 15a7493 to d50506c Compare April 22, 2015 20:45

Brett Hazen added 15 commits April 22, 2015 14:49

Sync up botched rebase of rt.erl

c55fd46

Use the full name of a test case (including backend) when reporting

4d70aaa

Update riak_test.config

cc9fd32

Add a few notes about debugging and version info

Add better debugging messages when no tests are specified and when gi…

6b6afe2

…ddyup_host is not set in config file

Update riak_test.config

c59d537

Add comment and example of `continue_on_fail'

Change how results table is written to lager file

8704610

Fix unit tests for rt_config

a3671c9

Register Erlang setup before determining which tests to run so YZ tests

d6dca14

can be run

Addresses BTA-207 to fix cuttlefish_configuration test and

92fd1e5

rt_harness_util:deploy_nodes/5 to handle cuttlefish

Close files after making local copies for GiddyUp

1428fca

Fix clique table for atoms as failure reasons

fe374ec

Add unit tests for results table

2019b2e

Resync tests from master

a212c40

Put lib directory back in git because intercepts are stored there

d3975f6

hazen force-pushed the feature/mixed-mode-riak-test branch from 7edbe91 to deebded Compare May 13, 2015 21:22

Brett Hazen and others added 8 commits May 14, 2015 14:51

Handle the case where versions are single atoms, not version/config

f25fbac

tuples when building a cluster

Update README.md

25c80f9

Update README.md

e76c192

Fix kv679_dataloss for new mixed framework

524833b

- Changed hard-coded devrel path in test - Update relpath/1 to resolve new version paths

Fix deploy_nodes/1 versions for riak667_safe

f53e630

- WIP: Breaking change that is untested for node defintions to support

b446979

cluster construction from rt_riak_cluster.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Riak Test v1.1.0 #786

Riak Test v1.1.0 #786

jburwell commented Apr 16, 2015

jburwell Apr 16, 2015

hazen Apr 16, 2015

jburwell commented Apr 16, 2015

Riak Test v1.1.0 #786

Are you sure you want to change the base?

Riak Test v1.1.0 #786

Conversation

jburwell commented Apr 16, 2015

jburwell Apr 16, 2015

Choose a reason for hiding this comment

hazen Apr 16, 2015

Choose a reason for hiding this comment

jburwell commented Apr 16, 2015

Notes on the planner design