nonlinear fit of experimental data

Question

Maura E. Monville el 19 de Ag. de 2019

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/476749-nonlinear-fit-of-experimental-data

Comentada: Maura E. Monville el 9 de Sept. de 2019

Dear MatLab Experts,

I would like to generate a nonlinear regression model to fit my experimental data 'Mk_Superf_FSF' as function of the independent variables 'MaxFDiam' and 'MinFDiam' which are respectively the max and min diameter of an arbirarily shaped closed and connected 2D surface. I also added the variable 'Area' which is obviously correlated to max and min diameters so I think it is not wise to use that as well.

I was suggested a linear fit for the experimental data (see attached picture). The 7th order polynomial p(x) fits the data very well but the suggested formula is non physical. In fact, the variable used is a sum of quantities with different units:

x = MaxFDiam * MinFDiam + Area / MaxFDiam + Area / MinFDiam + MaxFDiam / MinFDiam

I cannot assign a units to ithe resulting sum because the product of the two diameters has units [mm^2.] whereas the Area/MaxDIameter has units [mm], the ratio of the two diamters is unitless.

I tried to fit a sum of two negative exponentials where in the exponent I have the Area ad the product of the diameters respectively. MatLab complained printing out that the Jacobian has a column of all zeros. I tried some other combinations of exponential functions. Again MatLab complained stating that the model returns "NaN" of "Infinity".

Some other times MatLab printed out that that maximum number of iterations had been exceeded.

I tried a power-law fit as follows:

coeffs0 = [0.8672 1 1]

opts = statset('fitnlm');

opts.RobustWgtFcn = 'bisquare';

X = [MaxFDiam' MinFDiam'];

mdlfun = @(coeff, X) coeff(1)* X(:,1).*X(:,2).^coeff(2) + coeff(3);

mdl = fitnlm(X,Mk_Superf_FSF',mdlfun, coeffs0,'Options', opts, 'CoefficientNames', {'a' , 'b', 'c'});

This time MatLab did not complain but the resulting model is anything but good. The R^2 value is awful. The P_values are very high except for one.

mdl =

Nonlinear regression model:

y ~ a*x1*x2^b + c

Estimated Coefficients:

Estimate SE tStat pValue

________ ________ ________ __________

a 0.007407 0.02121 0.34922 0.73352

b -0.26657 0.87603 -0.30429 0.76658

c 0.93603 0.048648 19.241 8.0919e-10

Number of observations: 14, Error degrees of freedom: 11

Root Mean Squared Error: 0.0363

R-Squared: 0.215, Adjusted R-Squared 0.0721

F-statistic vs. constant model: 1.51, p-value = 0.264

Maybe the model is not right. Maybe the initial parameter values are not good.....

I would greatly appreciate some help at getting a decent fit. Above all, I would like to learn techniques to:

(1) devise the model formula

(2) choose the initial parameter values

Thank you so much for any suggestion and help.

Best regards,

Maura E. M.

14 comentarios
Mostrar 12 comentarios más antiguosOcultar 12 comentarios más antiguos

dpb el 19 de Ag. de 2019

Whassup w/the first observation? Looks like complete outlier.

The response versus min diameter also looks peculiar w/ a couple of points in the middle that are very far out of line with adjacent before/after.

The response versus the area variable is far more well behaved than either of the diameter variables with again, the exception of the first point just isn't even close to the rest.

I'm not at all surprised the model doesn't fit well...you talk of the linear combination factors model not being physical; what does the system represent and is there any physical correlation that it should follow to guide the model? That's always the best thing to have if there is anything one can use.

As far as units on the other expression, while it's not the question you asked nor necessarily the best way to fit, one simply assigns units to the coefficients as needed to match the independent variables such that the fitted response does have the right units. Granted, the resulting coefficients may have no real physical interpretation, but you can make the units work arbitrarily as well as the choice of terms in the fit.

Maura E. Monville el 20 de Ag. de 2019

Custom-Made-Collimator-On-Applicator.jpg

The data I posted are dose measurements of the Field Size Factor (FSF) carried out by a Markus ion chamber.

It is stated in the IAEA TRS-398 Code Of Practice that the field size must be at least twice the transversal size of the sensitive volume of the detector to get a meaningful reading. That's why the 1st observation is indeed an outlier. In fact the field diameter is 6[mm]. Therefore it violates the TRS-398 recommendations. That is, fields whose size is less than 1[cm] require a different detector. The Markus is not the right detector in this case.

What defines the field size is the aperture of a patient-specific collimator (see attached example of a collimator mounted on the applicator). Such collimators are custom-made to conform the radiation beam to the shape of the target (that is the ocular tumour). They are used to treat eye cancers with proton therapy. The length scale of these devices is the [mm].

The Area is obviously calculated from the polygonal that defines the aperture rim.

I extracted some other features like the Max and Min Feret diameters, excentricity, circularity, bounding box, and so on. There is no dependence on any of them but the diameters.

The goal is to find out if the measured FSF depends on the size and shape of the specific collimator. Staring at the measurements, it seems that there is no such dependence if we accept an error of about 0.2%..

The data I provided reresent the FSF for 14 different patient-specific collimators.

The difference among the FSF is within 1% with the exception of the outlier.

We are still looking for explaining the difference we measured.

You said the Area fits better than the product of the diameters. Did you use a power-law model?

Is there abetter model you suggest?

Thank you.

Regards

dpb el 20 de Ag. de 2019

Editada: dpb el 21 de Ag. de 2019

I hadn't yet fitted anything; I was just exploring the dataset by plotting various ways...fitting blindly w/o visualizing first is fools' errand. I plotted against each of the independent variables (after sorting by the variable) and those were enough to make me ask some questions before trying to go further...here are a couple of the plots--

As noted, these are plotted by sorting on the independent variable and using the sorted index for the response variable. What seems peculiar with the min diameter is the more jagged and the drop in the response for the two cases around 11-12. That is a very difficult detail to fit with precision and just raises questions as to whether is or is not an artifact of the measurement or real.

This is an even more detail look than I had done last night--what this shows is that the max diam and the area are nearly surrogates for each other altho one must remember in these "one at a time" plots the order isn't quite identical; this was done simply to visualize whether there was an apparent correlation with the independent variable of the desired predicted response.

I don't know the definition of the FSF nor how much precision can be presumed to be associated with the observation but with area there would appear to be a peak then a somewhat exponential decrease as area increases. That is pretty-much the gross shape with the two "diameters" and, of course, the area is going to be a function of those albeit given the arbitrary shape there's no direct simple relation there.

I suspect the problem here is that arbitrariness in the shape -- and even if you developed an almost perfect correlation from these data there would be no reason to believe it would hold for another set of observations for which the shapes weren't the same or very similar. Possibly it is that there is a unique feature there that distinguishes the two "funny" cases with the minimum diameters that isn't present in the rest of the samples.

I would wonder if other measures of geometry that try to represent the shapes in more categorical terms might produce better predictors -- like measures of curvature or lobes or such--maybe measures of perimeter might be an indicatior of that difference from being just a circular opening, who knows. I'd probably study the outlines of those shapes against the response and see if I could pick out any pattern that seemed to correlate...

Maura E. Monville el 21 de Ag. de 2019

Thank you so much for your deeply enlightening remarks.

I tried to fit a linear regression model after removing the 1st observation which is a physical outlier. I did not come up with any physically meaningful models.

I forgot to point out that the non-physical linear model cannot be made physically meaningful by assigning proper units to the model coefficients. Infact the model finds coefficients for powers of the varialbe "x" which itself cannot be assigned any units because it is a sum of terms whose units are different. Just have a look at the picture I sent previously.

I have not tried yet a nonlinear model after removing the outlier.

I can try. Do you think it will make a big difference?

By the way, I have plotted together all the measurements carried out with the Markus ion chamber, regardless of the SOBP used (attached plot). My conclusion, staring at the composite plot, is that the measurements for the Intermediate and Superficial SOBP are very similar. They almost coincide. This may be due to a bias in our choice of representative SOBPs (see picture with the 3 SOBPs).

SOBP = Spread Out Bragg Peak.

It represents the treated tumour depth in the direction of the proton beam.

FSF is a ratio of doses.

FSF = (Dose delivered at the SOBP center by a collimator) / (Dose delivered by a reference collimator)

The Reference collimator is a perfectly circular collimator whose diamter is 25 [mm].

Radiation Dose = Energy / (Unit Mass) the units are [MeV/Kg] also called [Gy]

I agree the independent variables are correlated. For sure there is some correlation between the Area and the Max and Min Diameters.

The goal of this project was to find out whether there is a dependence of the Field Size Factor (FSF) on the size and shape of the field and on the SOBP.

The size of the field is determined by the collimator area as the collimator does conform the radiation field. In short, only the protons that pass through the collimator aperture reach the patient. The other ones are stopped in the collimator brass thickness.

I have attached three other collimator shapes that we selected. Actually, I attached the approximated collimator aperture generated by my MatLab code which was necessary to incorporate these custom-made components in the Monte Carlo model of the synchrotron (machine that produces the proton beams). I cannot attach the true collimator pictures as they carry the patient names. It would be a privacy violation.

I used MatLab function 'regionprops' to extract the characteristic feature of the collimator aperture. I wrote a script to convert the returned area and Feret diameters form pixels into [mm]

I do not know any other feature that characterizes the area defined by a polygonal.

Any suggestion is very welcome.

Thank you.

Best regards

dpb el 21 de Ag. de 2019

I didn't say you could make the coefficients physically meaningful, only that you can assign arbitrary units to them such that the coefficient times the independent variable(s) ends up with units of the response. That's almost self evident as the response is in a given set of units so the prediction is reproducing those whatever the terms in the correlation are.

I also didn't say it was going to be easy (or even necessarily, possible) to develop a correlation given the data you have.

I would wonder which of the observations goes with which of the representative collimator shapes? I'd like to be able to compare those to which observation they generated.

Of those, there are two basic shapes, one basically an ellipsoid while the others are what I'd call a kidney-like shape as was the picture you sent earlier. A collection of those images with their associated response would be interesting.

I don't yet know whether can find a model that would predict these results or not -- probably could if made it specific enough with respect to each individual case but I still wonder if such would ever be of any value as far as drawing conclusions from regarding basic relationships.

Is it possible to take measurements with theoretical shapes without actually using patients so one could start with defined geometric shapes and then bring in the eccentricity factors? If so, I think I'd try to start with such a designed experiment where made very defined changes in shapes that are computable and classifiable and see if making changes there would produce predictable results. Then one could perturb those idealized shapes into approximations or the real ones and see how the results were affected. Just a thought--"you can't control what isn't controlled" and happenstance variables are the bane of statistics and modelling.

Maura E. Monville el 21 de Ag. de 2019

I think I did not explain myself about the lack of physical meaning of the polynomial fit.

The polynomial is:

p1*x^7 + p2*x^6 + p3*x^5 + p4*x^4 + p5*x^3 + p6*x^2 + p7*x + p8

where

x = MaxFDiam * MinFDiam + Area / MaxFDiam + Area / MinFDiam + MaxFDiam / MinFDiam

x cannot be assigned a units because

MaxFDiam * MinFDiam % has units [mm^2]

Area / MaxFDiam [mm] % has units [mm]

Area / MinFDiam [mm] % has units [mm]

MaxFDiam / MinFDiam [] % this term is unitless

what are the units of a sum of terms whose units are:

[mm^2] + [mm] + [mm] + [] = ???

The polynomial is a sum of powers of x ....

FSF (response variable) is unitless because it is the ratio of doses [Gy /Gy] = []

I agree on the need to relate the FSF to the proper observation.

I kept the order we folled when we measured the FSF. I agree tha to look for a relationship between the FSF and the collimator Area it should be better order the data in increasing values of Area. The same applies to each diameter.

The attached table2 shows all the FSF measurements with respect to each SOBP type and the relevant features of the collimator aperture.

Please, keep in mind that FSF is the radiation dose measured using the single collimator divided by the radiation dose measured using the Reference collimator.

As you can see there are two standard perfectly circular collimator. Namely,

15mm (whose diameter is 15 [mm])

6mm (whose diameer is 6 [mm]

All the other collimators are patient-specific so their shape is forged to reproduce the contour of the tumour.

The attached table1 shows the measurements of the SOBP

Range = Start + Depth

So a SOBP can also be characterized by its Range.

Thank you for all your insight.

Kind regards

dpb el 21 de Ag. de 2019

1) The 6 mm is the outlier so it doesn't help. The 15 mm is right in the middle of the responses outside those that are the five or so that are the "peak" values. The curious thing would be to try to isolate why those are outstanding.

2) So the 11,12,13 do correlate with the same sequence in the original dataset? I'll have to study that some. Still think it would be worthwhile to line up the images with the response to observe side by side what shape yields what response that don't have enough data to do yet here.

3) A seventh-order poly with 14 (and really should just be 13) points is well over-fitted--the goodnes of a fit will be as much coming from the fact the solution is constrained so much as that the chosen model actually represents the functional form. As noted, it's still possible the other shapes haven't seen are different-enough that there could be a categorical variable to incorporate as grouping variable rather than quantitative. May not be, too, but I'd want to investigate further that direction.

I've not had the time to dig into the additional info in the pdf files as yet...have to go do some personal errands at the moment; maybe tonight could get back and look some more. It is an interesting problem and am just beginning to get enough to have a clue about it....

Maura E. Monville el 22 de Ag. de 2019

Table2.jpeg

Please, find attached

the table with the collimator identifiers (AXXXXXX)
the zipped archive containing the collimator aperturesin form of text files

Thre are 3 standard collimators:

25mm is the REFERENCE used in the denominator of the formula to calculate the FSF. its data are red in the table
15mm is a regular cylindrical collimator whose aperture is a perfect circle with 15[mm] diameter
6mm is a regular cylindrical collimator whose aperture is a perfect circle with 6[mm] diameter

There are 12 patient-specific collimators whose aperture shape is defined by a polygonal given through its (x,y) coordinates in a text file contained in the zipped archive. The link between the table and the collimator aperture (closed curve) is through the collimator identifier "AXXXXXX" which ia also the name of the text file containing the coordinates of the polygonal defining the collimator aperture.

This way the collimator aperture shape can be easily seen by plotting its (x,y) coordinates.

Thank you very much

Maura E. Monville el 23 de Ag. de 2019

Archive.zip

Done.

I have cleaned the only two files, out of 12, containing the patient's name.

Now you can easily upload the txt fieles with MatLab and plot each of them.

Thank you

dpb el 23 de Ag. de 2019

Ah! OK...I only opened and looked at the first one and presumed all the rest were the same...if weren't same number of header lines then I guess will need to to it again. Mayhaps that's why some seemed to have gaps in the perimeters...

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Jon el 21 de Ag. de 2019

0
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/476749-nonlinear-fit-of-experimental-data#answer_388552

Editada: dpb el 23 de Ag. de 2019

In your example you find a fit to a function of one variable, and are somehow looking for a combination of terms to form that one variable. Do you need to get it into this form or is it ok to have the predicted value, y be a function of two variables?

Assuming the latter, in case it is helpful I just tried a somewhat simplistic approach of considering the response to be a quadratic function of the two inputs MinFDiam and MaxFDiam.

Regarding motivation for choosing this form, I guess you could consider this to be a low order taylor series representation. (one up from linear which I tried and didn't fit very well). I'm not sure of the precise mathematical statemement of this, but the general notion is that for small enough regions all continuous functions are well approximated by just the low order terms of the Taylor series, and in particular functions that show some curvature are well approximated by quadratics in a small enough region.

I am not familiar with using fitnlm so I just used fitlm as follows

x1 = MinFDiam(:) 
x2 = MaxFDiam(:)
y = Mk_Superf_FSF(:)
mdl = fitlm([x1 x2 x1.*x2 x1.^2 +x2.^2],y)

This gave the following statistics

mdl = 
Linear regression model:
    y ~ 1 + x1 + x2 + x3 + x4 + x5
Estimated Coefficients:
                    Estimate          SE         tStat        pValue  
                   ___________    __________    ________    __________
    (Intercept)        0.60854      0.075803       8.028    4.2585e-05
    x1                0.057919      0.025395      2.2808      0.052008
    x2               0.0015331      0.014779     0.10374       0.91993
    x3              0.00067369     0.0014413     0.46741       0.65268
    x4              -0.0024835     0.0012742     -1.9491      0.087118
    x5             -0.00031494    0.00071727    -0.43908       0.67222
Number of observations: 14, Error degrees of freedom: 8
Root Mean Squared Error: 0.0204
R-squared: 0.82,  Adjusted R-Squared: 0.708
F-statistic vs. constant model: 7.3, p-value = 0.00746

Which does not seem too bad.

16 comentarios
Mostrar 14 comentarios más antiguosOcultar 14 comentarios más antiguos

Jon el 22 de Ag. de 2019

Editada: Jon el 22 de Ag. de 2019

Looking at these p values we can see that the terms that include MaxFDiam have high p values which suggests that changes in MaxFDiam do not produce a large response in Mk_Superf_FSF. In other words it appears that your output is primarily driven by just the one variable MinFDiam.

Following up on that I tried fitting Mk_Superf_FSF = c2*MinFDiam^2 + c1*MinFDiam + c0

This had a similar R-squared as the previous fit above. Which is not surprising as the terms that involved MaxFDiam had not contributed much to the fit.

Continuing then to think of this as just fitting to the one variable MinFDiam, I looked at adding a cubic term

Mk_Superf_FSF = c3*MinFDiam^2 + c2*MinFDiam^2 + c1*MinFDiam + c0

as follows, which gave quite a high R-squared and low p values. So perhaps this is a useful model. Sorry when I copy and paste the MATLAB output it seems to wrap, but please run it yourself to see it better.

x1 = MinFDiam(:) 
x2 = MaxFDiam(:)
y = Mk_Superf_FSF(:)
mdl = fitlm([x1,x1.^2,x1.^3],y)

mdl =

Linear regression model:

y ~ 1 + x1 + x2 + x3

Estimated Coefficients:

Estimate SE tStat pValue

__________ __________ _______ __________

(Intercept) 0.11948 0.06442 1.8547 0.09333

x1 0.2005 0.017762 11.288 5.1814e-07

x2 -0.014639 0.0015451 -9.4745 2.6012e-06

x3 0.00034586 4.2364e-05 8.1642 9.8515e-06

Number of observations: 14, Error degrees of freedom: 10

Root Mean Squared Error: 0.00677

R-squared: 0.975, Adjusted R-Squared: 0.968

F-statistic vs. constant model: 131, p-value = 2.53e-08

Looking deeper into this, I plotted the resulting fit and original data to obtain.

Subjectively, this looks "overfit" to me.

I would suggest (as I think did @dpb) that you need to check if the response for MinFDiam=6 is an outlier. It clearly drives the fit below. If we eliminated that one point, it looks like the output doesn't even depend on MinFDiam.

In general, if possible, it is best to have some form of theoretical model that gives you an equation with some unknown coefficients. Then just use the regression to fit the unknown coefficients.

Maura E. Monville el 23 de Ag. de 2019

Editada: dpb el 23 de Ag. de 2019

I confirm the 6mm is physically an outlier. It should not have been measured with the Markus ion chamber. That is stated in the report IAEA TRS-398 Code Of Practice.

Yesterday I uploaded the Table reporting all the measurements for all SOBPs. I also uploaded a zipped archive with the text files containing all the collimator aperture polygonal coordinates (x,y) for whoever wishes to see (plot) the collimator shapes.

The link between the Table and the polygonals is through the collimator identifier of form Axxxxxx (capital letter "A" followed by six integer positive digits).

I placed in a matrix the Area followed by the three FSF from the single SOBP and calculated the correlation coefficients. Surprisingly the FSF from the Deep_SOBP has the highest corelation with the Area. However the plot of the three FSF, versus the Area, shows FSF for the Superficial_SOBP has the best agreement.

Here is my code:

Area = [28.274 176.71 71.32 97.45 116.21 103.22 119.63 159.29 201.80 220.96 271.45 282.36 327.04 277.54]; 
[AreaSrt, AreaInd] = sort(Area);
% SUPERFICIAL SOBP
Mk_Superf_FSF = [0.8672 1.0053 1.008 1.0142 1.0128 1.0142 1.0142 1.004 1.0062 1.0031 1.0018 1.0022 1.0022 1.0018];
Mk_Superf_FSFSrt = Mk_Superf_FSF(AreaInd);
R = corrcoef(AreaSrt', Mk_Superf_FSFSrt')
% INTERMEDIATE SOBP
Mk_Inter_FSF = [0.86664 1.0028 1.0098 1.0098 1.0123 1.0109 1.0102 1.0049 1.0042 1.0018 1.0007 1.0007 1 1.0014];                
Mk_Inter_FSFSrt = Mk_Inter_FSF(AreaInd);
R = corrcoef(AreaSrt',Mk_Inter_FSFSrt')
% DEEP SOBP
Mk_Deep_FSF = [0.84279 0.99719 0.99264 1.0003 1.0008 1.0001 0.99946  0.99913 1.0003 0.9993 0.99962 1.0001 0.99995  1.0004];                              
Mk_Deep_FSFSrt = Mk_Deep_FSF(AreaInd);
R = corrcoef(AreaSrt',Mk_Deep_FSFSrt')
% Place all vectors in matrix
M = [AreaSrt' Mk_Superf_FSFSrt' Mk_Inter_FSFSrt' Mk_Deep_FSFSrt'];
% COMPUTE MATRIX CORRELATION COEFFICIENTS
Rnew = corrcoef(M)

I get the following matrix of correlation coefficients showing that the highest correlation is between Area and the FSF for Deep_SOBP:

>> Rnew = corrcoef(M)
Rnew =
            1      0.36357      0.36585      0.48136
      0.36357            1      0.99896      0.99037
      0.36585      0.99896            1      0.99095
      0.48136      0.99037      0.99095            1

dpb el 24 de Ag. de 2019

Editada: dpb el 25 de Ag. de 2019

"I think the[r]e is a physics explanation for the higher measurements."

And well may be but I think it highly unlikely that explanation is in the variables controlled/measured here.(*)

The MC simulation misses those specific points by far more than the others in a consistent direction so whatever it is isn't included in that model, either.

(*) And note that even if you were successful at building a model by some magic transformation of variables or nonlinear curve-fitting strategem that did manage to fit the observations from these measurements that to infer that would be the physical reason behind the values would be a gross misrepresentation of such a fit even if you could make it happen with a set of coefficients with consistent units.

Maura E. Monville el 9 de Sept. de 2019

We sort of come up with a physical explanation.

The reason for the dose measured at the center of the SOBP to be the highest for the Superficial SOBP, decresing for the Intermediate SOBP, and the lowest for the Deep SOBP is clearly beam attenuation due to the center of the SOBP being located deeper and deeper in the eye.

The asymptotic trend towards 1 of the measurement as the colimator area grows bigger is possibly due to scttered radiation not reaching the SOBP center, so not contributing to the measured dose, since it gets more spread laterally as the collimator aperture grows bigger.

Noteworthy is that the energies usd for ocular treatment are very low. Scttered radiation produced by the collimator has of course even lower energy.

I would like to thank everyone who has taken the time to look into my problem.

Thank you.

Sincerely,

Maura E. M.

Iniciar sesión para comentar.

nonlinear fit of experimental data

14 comentarios
Mostrar 12 comentarios más antiguosOcultar 12 comentarios más antiguos

Respuestas (1)

16 comentarios
Mostrar 14 comentarios más antiguosOcultar 14 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

nonlinear fit of experimental data

14 comentarios Mostrar 12 comentarios más antiguosOcultar 12 comentarios más antiguos

Respuestas (1)

16 comentarios Mostrar 14 comentarios más antiguosOcultar 14 comentarios más antiguos

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

14 comentarios
Mostrar 12 comentarios más antiguosOcultar 12 comentarios más antiguos

16 comentarios
Mostrar 14 comentarios más antiguosOcultar 14 comentarios más antiguos