The first level of the t3 process aim to standardize logbook data. These standardization was divided into 8 steps:
Process | Code/method associated | Class(es) associated | Observations |
---|---|---|---|
Process 1.1: Raising Factor level 1 | rf1() |
full_trips | |
Process 1.2: Raising Factor level 2 | rf2() |
full_trips | |
Process 1.3: logbook weight categories conversion | conversion_weigth_category() |
full_trips | |
Process 1.4: positive sets count | set_count() |
full_trips | |
Process 1.5: set duration calculation | set_duration() |
full_trips | In hours |
Process 1.6: time at sea calculation | time_at_sea() |
full_trips | In hours |
Process 1.7: fishing time calculation | fishing_time() |
full_trips | In hours |
Process 1.8: searching time calculation | searching_time() |
full_trips | In hours |
Catches filled in logbooks are based on visual estimations when they rise from the purse senner salabarde. This is the only technical way available at this moment to have an estimation by species. Thereafter, catches will be stored in one or several wells. During the landing process, well catches will be weighed, generally through a peson located under the unloading crane. By move from set to well(s) we lost the exact information about catch location.
Regarding this process, for each full trip we will calculate a
Raising Factor (RF), ratio of landing weight on catch weight. To recall,
one full trip should be composed of one or several partial trip, for
example if a vessel unloads a part of these wells and go back to the sea
with not empty well(s). Furthermore, a RF level 1 value is normally
between 0.8 and 1.2. If any values were out of range, an investigation
if necessary for data quality control. This range could be modified
through function parameters rf1_lowest_limit
(by default at
0.8) and rf1_highest_limit
(by default at 1.2). In
addition, you will find in the function log a warning for each RF level
1 value out of range. This ratio will be applied to logbook catches to
adjust declared values (by rising or lower them).
Be careful, so far in the process all RF level 1 values will be applied to the logbook catches associated, even if values are out of range. It’s up to the user to check the log file and correct data, if necessary, before moving to the next process.
Furthermore, not all the well species are weighted during the landing
process. Only several tunas species are purchased (and weighted by
association). In addition, these species should be different for each
fleet, especially because canneries don’t purchased through the same
rules according to the fleet. This raising factor will be estimated only
on theses species. Species list could be modified through the function
parameter species_rf1
. For now theses following selections
were validated since several years:
for French (French and Mayotte fleets), use codes 1 (Thunnus albacares), 2 (Katsuwonus pelamis), 3 (Thunnus obesus), 4 (Thunnus alalunga), 9 (mix of tunas species) and 11 (Thunnus tonggol).
for Spain (Spanish and Seychelles fleets), use codes 1 (Thunnus albacares), 2 (Katsuwonus pelamis), 3 (Thunnus obesus), 4 (Thunnus alalunga), 5 (Euthynnus alletteratus), 6 (Auxis thazard), 9 (mix of tunas species) and 11 (Thunnus tonggol).
In addition, if logbook(s) are missing we can’t apply the process. In
that case, we will tried to apply the process of Raising Factor level 2
(see section below regarding the process rf2()
).
By the way, there are several cases when the RF level 1 can’t be calculated. Take a look to the diagram below to have a brief overview of the process. You could notify that the structure of the process should be strange in terms of informatics efficiency (for example with repetitions of some steps, not necessarily on the first look). This is perfectly normal because we think about the potential improvement of the process in the future and we let the field open for an easier implementation.
In some cases, especially for historial data, several logbooks trips were not collected or available. Therefore it’s impossible to apply the process 1.1. Don’t do anything for this trips would result in an under-estimation of catches among the global process. Of course and like the process 1.1, we made the hypostesis that all the landings were know and available.
So far, this process is partially implemented in the source code and in our data simulation since 2000 we haven’t been faced with trips with missing logbooks. Due to time constraints, the development of this process has been postponed, but it will be essential to finalise it when we run our process on the whole time serie (and when we manipulated historical data). The figure below show the actual development of the process.
The core of the t3 process is to consider that the species composition of the logbooks was biased. Furthermore, logbook information related to global weight catches and weight categories are correct. Indeed, the weight categories of individuals as a strong influence on their commercial value. Furthermore, these weight categories change from one tuna fishing company to another, involves overlaps and are hardly usable directly from the logbook.
This process aims to homogenize these weight categories and create simplify categories divided in function of the fishing school and the ocean:
For each layer ocean/fishing school/specie/logbook weight category, a distribution key is applied for conversion to standardized weight categories. Details of the distribution key is available in tables below.
Specie | Logbook weight category | Weight ratio | Standarised logbook weigth category |
---|---|---|---|
YFT, BET & ALB | 1 | 1 | < 10kg |
YFT, BET & ALB | 2 | 1 | < 10kg |
YFT, BET & ALB | 3 | 1 | 10-30kg |
YFT, BET & ALB | 4 | 0,2 | < 10kg |
YFT, BET & ALB | 4 | 0,8 | 10-30kg |
YFT, BET & ALB | 5 | 1 | > 30kg |
YFT, BET & ALB | 6 | 0,5 | 10-30kg |
YFT, BET & ALB | 6 | 0,5 | > 30kg |
YFT, BET & ALB | 7 | 1 | > 30kg |
YFT, BET & ALB | 8 | 1 | > 30kg |
YFT, BET & ALB | 10 | 1 | < 10kg |
YFT, BET & ALB | 11 | 0,1 | 10-30kg |
YFT, BET & ALB | 11 | 0,9 | > 30kg |
YFT, BET & ALB | 12 | 1 | 10-30kg |
YFT, BET & ALB | 13 | 1 | > 30kg |
YFT, BET & ALB | 14 | 1 | > 30kg |
SKJ | All | 1 | < 10kg |
Others | All | 1 | Unknown |
Specie | Logbook weight category | Weight ratio | Standarised logbook weigth category |
---|---|---|---|
YFT, BET & ALB | 1 | 1 | < 10kg |
YFT, BET & ALB | 2 | 1 | < 10kg |
YFT, BET & ALB | 3 | 1 | > 10kg |
YFT, BET & ALB | 4 | 0,2 | < 10kg |
YFT, BET & ALB | 4 | 0,8 | > 10kg |
YFT, BET & ALB | 5 | 1 | > 10kg |
YFT, BET & ALB | 6 | 1 | > 10kg |
YFT, BET & ALB | 7 | 1 | > 10kg |
YFT, BET & ALB | 8 | 1 | > 10kg |
YFT, BET & ALB | 10 | 1 | < 10kg |
YFT, BET & ALB | 11 | 1 | > 10kg |
YFT, BET & ALB | 12 | 1 | > 10kg |
YFT, BET & ALB | 13 | 1 | > 10kg |
YFT, BET & ALB | 14 | 1 | > 10kg |
SKJ | All | 1 | < 10kg |
Others | All | 1 | Unknown |
Among the process, there are two major step:
During the first part, all logbook weight categories except the category unknow are standardised following the distribution conversion key display above. Each catch weight associated is multiply by the weight ratio associated. For several logbook weight category, like the categories 4 or 11 according to strata, this standardisation create new data because the initial logbook weight category is divied into several standardised logbook weight categories. It’s normal that at the end of the process you could have more catch items than before.
For the second part, the process try to make the conversion of catches filled in the unknown category. For each trip and at the scale of an ocean/fishing school/specie, proportion of standardised logbook weight categories will be applied on items filled as unknown logbook weight categories. For example, in a common stratum, if proportion weight of standardised logbook weight category are divided by 60% for the category < 10kg and 40% for the category > 10kg, catches weight of unknown logbook weight category will display in standardised logbook weight category with 60% of < 10kg category and 40% of > 10kg category.
The global process flowchart is available in the figure 3 below.
In this process, for each activity filled as null set (0), positive set (1), unknown set (2) or pocket capsizing (14) in the data, a count of set is done according to the catches associated. Like before, the global process flowchart is available below in figure 4.
The aim of this process is to calculate the duration of each set in hours. Like the process before, for each activity filled as null set (0), positive set (1), unknown set (2) or pocket capsizing (14), the set duration is calculated according to a linear function with two parameters a and b. These are found through a reference table, for each year, ocean, fishing school and country. Furthermore, a specific value provided for a null set (a fishing set without any catches associated). An example of this reference table is available below in the table 3.
Year | Country | Ocean | Fishing school | a | b | Null set value |
---|---|---|---|---|---|---|
2014 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2014 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2014 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2015 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2015 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2015 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2016 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2016 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2016 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2017 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2017 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2017 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2018 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2018 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2018 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2019 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2019 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2019 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2020 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2020 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2020 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
2021 | Seychelles | 2 | 1 | 0,33235 | 127,1 | 120 |
2021 | Seychelles | 2 | 2 | 0,33235 | 127,1 | 120 |
2021 | Seychelles | 2 | 3 | 0,33235 | 127,1 | 120 |
In addition the global process flowchart is available in the figure 5 below.
This step aim to make an estimation, at the trip scale, of the time at sea in hours. For each trip we have several possibilities:
The global process flowchart is available in the figure 6 below.
This process calculated the fishing time value in hours. Historialcaly, this process use a constant parameter according to the theorical day duration working time (12 or 13 hours according to the ocean and related to the global sunrise and sunset). Now, the process was modified to bring more accuracy in the calculation and module the duration of a working day according to the real sunrise and sunset of each day.
These parameters could be modify in the parameters
sunrise_schema
and sunset_schema
and use in
background the R package suncalc. The
function associated use latitude and longitude data to provide sunrise
and sunset times of the selected day. Furthermore, you can switch
between several values for these two parameters to adjust the
process:
At the trip scale and related to all the activities related, the fishing time value is the sum of each fishing time day if in the day activites associated at least one is different of no-wake road (4), damage (7), on hold (10), at the harbour (15) or fictive activity for trip without logbook (100).
The global process flowchart is available in the figure 7 below.
The aim of this last step is to calculate the searching time. Like the previous variables, it is expressed in hours and equal to the fishing time value minus the sum of the sets duration values.
The global process flowchart is available in the figure 8 below.