Welcome to my world

Here is my domain for splurging my ruminations on the STEM fields. Most of the stuff I discuss and research on this site is way beyond what we learn at school and what I am conventionally taught, so there may well be errors in my information or maths - please do not viciously troll the page with corrections, although constructive and useful criticism is of course welcome :)

Wednesday 31 August 2016

Does the harmonic series converge or diverge?

I have been doing some work (well a LOT of work) on brushing up a variety of precalculus and calculus topics (i.e. every one I can find on the website) on Khan Academy recently. This involved a huge amount of stuff I have done before, but usually with additional insights (such as logarithmic implicit differentiation) and new techniques (e.g. L'Hôpital's Rule for the two-sided limit of a function, and using the natural logarithm to turn exponential problems into rational functions for said rule to be applicable). But in my fascinatingly different learning on divergence/convergence tests for infinite series, something prickly came up.

As Sal Khan informs the viewer, quite correctly of course, there is a famous proof from French Medieval mathematician Nicole Oresme showing that the infinite harmonic series is divergent. It does this by way of the comparison divergence/convergence test. So first off, what is this test?

The direct comparison test involves the infinite series under test, SA, and a second infinite series SB. Every term of the series SA is smaller than the corresponding term in  SB, and all terms are non-negative. The statements are thus: if SB converges, then SA must also converge; if SA diverges, then SB must also diverge.

Both of these statements are intuitively easy to grasp. Upon reading this, I immediately questioned the use of 'must converge' versus 'is bounded', simply because the counterexample that sprung to mind was a sequence containing the sine or cosine functions, or any other type of oscillatory function such as one including the term (-1)n - here it would be ok to say it is bounded if the first condition was satisfied, yet the very nature of the function would potentially stop a true limit being ever reached. However, in reality although this is true it would be a trivial case to use, since it defeats the premise of actually using the direct comparison test if you already know the function is divergent. And this is forgetting the possible exception that the oscillator is appended to a decaying function, in which case the problem would sort itself by zeroing out as n approaches infinity. Nevertheless the test would still be useful for determining whether said function is bounded or not, in fairness.

So, moving onto the proof. Oresme took the series SA = 1 + 1/2 + 1/3 + 1/4 + ... (infinite harmonic series) and for each term found the smallest power of  1/2 that was smaller than or equal to it. This led to this comparison:

SA = 1 + 1/2 + 1/3 + 1/4 + 1/5 + 1/6 + 1/7 + 1/8 + 1/9 + ...
SB = 1 + 1/2 + 1/4 + 1/4 + 1/8 + 1/8 + 1/8 + 1/8 + 1/16 + ...

Since each term in SB is smaller than the corresponding term in SA, it must be that the value of SA up to the nth term is smaller than the value of SB at the same point. So we need to determine the behaviour of SB in the limit as n stretches off to infinity, in order to perform the direct comparison test.

It is easy to notice that the terms in SB can be grouped in a very special way - there are two quarters, four eighths, eight sixteenths etc. This leads to the clearly divergent series:

SB = 1 + 1/2 + 1/2 + 1/2 + ...

Since this is divergent, it must also be that the harmonic series is divergent too. This is nicely illustrated on Desmos, with the caveat that functions based on discrete summations of integral-domained terms plot discrete line segments centered about each integer. So it looks like a bunch of lines when in reality is is a set of discrete points. Anyway the visual representation is sound:


It's funny that out of all of that, the thing I'm most proud of it that I have finally found a vaguely practical use for the floor function! What it is doing in the red line (representing SB in the diagram) is finding the exact value of the power 0.5 needs to be raised to to equal the corresponding term in the blue line, then rounding it up to the next integer value.

Ok so that looks pretty convincing - the red line diverges so it stops the blue line converging too. It just seems very counterintuitive though. Although failing the nth term divergence test doesn't tell you for sure that a series converges, it would seem highly logical that the harmonic series should converge since its terms will get infinitely small in the limit! So for me, I want even more argument to convince myself that this is true.

The next thing we can do is to analyse how quickly the comparison function, SB, is growing. Obviously the rate of change decreases as n increases, because the additional increments drop in powers of 2 at a time. But the number of terms between each drop also increases by a factor of 2 each time (consider there are 2 terms of 1/4, 4 terms of 1/8, 8 terms of 1/16...). So, for example, when the gradient of the line is 1/128 (where 128 = 27), 27 < n < 28, or more clearly 128 < n < 256. More generally, we find that for some natural number k, the region in which the gradient is 1/2k is 2k < n < 2k+1. All this still doesn't help me reason this out though, because it would still suggest that as x ∞, the gradient of the line 1/ = 0.

In conclusion, it is very difficult to condense the reasoning to anything more satisfying than a series comparison at my current level of knowledge. Keep studying then...

Some thoughts on the Laplacian method for solving differential equations

In the summer months, since the conclusion of my exams and work experience, I have been at a loss of normal summery things to do. Instead, I have taken this opportunity to work on some extra-curricular techniques and subject areas within maths and physics, just for a bit of fun. Clearly in general, since work on transforms, matrices and quantum probability are very much new topics for me, it has taken a large amount of read-learn-practice before I can come up with some useful projects to demonstrate my new skills. In the meantime, I will have a short discussion about something I have recently learned, called the Laplace Transform.

It is a name that I have heard every now and again from my maths teacher, and across my wider reading on the internet, yet only with nothing else to study for have I been brave enough to delve into the actual processes involved. In essence, the transform is often described in the physical world as taking a problem in the "time domain" and translating it into the "frequency domain"; conversely on a more abstract mathematical level it is merely a way to simplify differential problems into contrived algebraic ones, which may suit the number-cruncher better as it is a generally more consistent method for solving a range of differential equations, as opposed to the more guesswork-orientated method of characteristic-equation-then-particular-integral.

So how does it work? The Laplace transform of a function f(t) is found by multiplying it by the function e-st, then finding the definite integral of this product with respect to t, between the limits of positive infinity and zero. It seems strange when put as bluntly as that but, as soon as some examples are performed, it becomes clear how convenient such a multiplication is by the seemingly arbitrary exponential term. This is for two reasons: integration by parts is straightforward where dv/dt is substituted as e-st; the exponential of a negative independent variable reduce to 0 or 1 in the limits. Using these key understandings, and a few examples from the internet to guide the method, I set out to prove a table of such transforms - with the exception of the convolution integrals at the very bottom I was able to do so, and took great pleasure and satisfaction in doing so! Here are a couple of good examples, which also introduced new functions to me which will come in useful in the future:





The second proof I did was perhaps blasé to choose as an example - in my black book where I have recorded all the written work from my work experience and personal study, I sequentially worked through simpler proofs before the moderately-challenging ones such as tcos(at), ones such as sin(at) and cos(at). Yet I felt it would be unsatisfying to reference results such as "given that the Laplace transform of sin(at) is ...", so I have effectively completed three proofs in one during the second example (it is not difficult to infer from the working that L{sin(at)} = a/(s2+a2) and that, with a fulfilling sense of symmetry, L{cos(at)} =  s/(s2+a2)). This symmetry does indeed somewhat extend into the mathematics of the order-one-polynomial-trig-function product, where L{tsin(at)} =  2as/(s2+a2)2.

Moving swiftly on... One of the more important conceptual implications of the Laplace Transform, which helps to link mathematical abstraction with the physical world, is to see what happens when the operator is applied to f(t) = t. One finds that 1/s is obtained, nicely satisfying the equation that frequency ∝ time-1. Hence it is clear to see how useful such a relation might be, for example, in the field of electrical engineering as one tracks electromagnetic oscillations over time in a differential perspective.

Okay so now I know how Laplace Transforms work, a little bit of context on the kind of subject areas they may be useful in, and furthermore how to compute them. Now it's time to actually put them to their main use, which is for solving differential equations. Here is an example of such a differential equation - notice how I first solve it using the method more familiar to me, that of the characteristic equation and particular integral amalgamation, then contrast it with the Laplacian approach. The new method certainly seems more contrived in this simplistic case:


In this case the two solutions are very similar in length, but the algebra is undoubtedly more involved in the Laplacian version - the partial fraction decomposition in particular adds unnecessary complexity to the working. But observe how the initial conditions are worked nicely into the solution, making it more coherent, whereas in the familiar method they are used to solve for unknown constants which travel through the solution unresolved until the very end. So overall there is no real improvement using the transforms. But let's attempt a more complicated example. First, the auxiliary method...

(note that I say 'displaced from equilibrium' but I mean 'displaced from natural length of the spring - this also applies to the first example I did)


Then, the Laplacian method:






It is very difficult to see if these two solutions are equivalent, since they use different combinations of overarching constants. However I plotted them on Desmos, as can be found here. From there is is clear that they do not agree, yet one is considerably more likely to be correct - that is the blue line, the one that passes through the point (0,x0) whereas the other line does not; interestingly this is the solution from the Laplace transforms! I could go back through the first method and find the mistake but I am satisfied that the relative ease of the second method has been demonstrated, so it would be pretty unnecessary.

The point of that was to show that as problems get more complicated, the Laplace Transform comes into its own. But complicated in a very specific way - the auxiliary method does not become much more difficult as the degree of the differential equation increases, as long as the characteristic equation can be factorised without too much trouble, yet the Laplace method will since each expansion of a transformed derivative increases the number of terms in the algebraic result (L{x''} produces 3 terms, L{x'''} produces 4 terms etc.). No, what matters is that the non-homogeneous f(t) element of the equation becomes more exotic - this is what causes so much trouble in finding the particular integral, but it hardly makes the Laplace method more difficult because only some partial fraction manipulation is required to separate it off into various sines, cosines, exponentials and combinations of those. Therefore Laplace Transforms are most effective, at the level I understand them, for solving low-degree non-homogeneous ODEs with polyfunctional f(t).

But there is more to the Laplace Transform than this, which is but an entry level understanding of how to apply them in linear situations. In the first example there was no non-conservative air resistance to waste mechanical energy, so oscillation extended indefinitely towards t = +∞, yet in the second there was an exponential decay in the amplitude of the oscillation. This appeared in the s-domain as the given trigonometric function transform, but s-shifted by a constant - and hence was a use of the s-shift Laplace Transform, one of the most useful ones in the table. There is a very similar t-shift, where the exponential function appears as a multiplier in the s-domain - in the t-domain this takes the form of uc·f(t-c), where uc is the unit step function at c.

But the versatility doesn't stop there. The issue I have at the moment is that I cannot progress beyond dealing with linear ODEs, since I have not yet learned how do perform convolution integrals (these produce a product of functions in the s-domain) or transforms of a product of two functions. All things to look forward to learning in the future I suppose!

Monday 13 June 2016

Analysing water in a wine glass

The realm of 3-D graphing is fairly new to me, and I have only really learnt what I know through extended research on the web - it is hardly touched upon in conventional A-level maths at all. One evening I decided to test exactly what I know, and what I could possibly learn, to model a seemingly very simple physical situation: liquid in a wine glass under gravity.

First, I assumed that the wine glass is a paraboloid in shape - this is a parabola of the form y = ax2 that has been rotated around the y-axis (I don't really see the point in the more general format of rotating y2 = 2ax about the z-axis myself, and the horizontal orientation would make the situation far less convincing in any case) to form a, well, 'wine glass shape'! Then, I took a 3-D plane of the form y = xtan(θ) and analysed the intersections it makes with the paraboloid - this plane effectively models the surface of the liquid.

The development of this short project is seen below. The part I am most proud of is that I have learned how to integrate under a surface within a circular region using polar coordinates, which is much simpler than the traditional rectangular coordinate method, requiring multiple trigonometric substitutions to achieve the same result. Anyway, enjoy reading. For the first time I know someone is reading this, so I hope it is all correct and logical to follow...

The gameplan for my method might be useful:

 Now, the bulk of the maths:


The implications of these formulae are quite powerful really, but it is quite difficult to draw using Desmos - I have not yet worked out how to parametrically add the z-axis into the 2-D plotter. However, I have done a couple of graphs on the online graphing calculator: http://tinyurl.com/zgsvzc8. The angle is changed with the slider 'T' between -π/2 and π/2, and the volume of liquid and shape of the glass are also altered with their respective sliders.

Seeing the changing shape of the surface of the liquid (red) and the cross-sectional profile of the glass at the same time as the angle of tilt changes is very revealing - the red ellipse disappears at the point where the surface no longer straddles the minimum point of the glass (the bottom, at the origin): although real wine glasses actually have a slight curve back in towards the top, a quadratic curve has no such 'lip'. Therefore as soon as the liquid is all on one wall of the glass, it is poured out!

Please bear in mind that I have caused the surface of the liquid to rotate instead of the glass itself because it is much simpler to do. Therefore the graph is from the perspective of the glass instead of the Earth's gravitational field!

Wednesday 11 May 2016

Finding the length of a section of a curve

This morning I was set a new problem, which I feel I will be working on for a fair while - what is the volume of a gas-filled pillowcase shape (two rectangles stuck together around the edges, and inflated to the maximum possible volume)? I already have my suspicions as to how the 3-D surface of a half-pillowcase shape could be modelled as a function - a 2-D function needs to be found which, for a given length of the curve (the length of one of the rectangles making up the pillowcase), maximises the area under it. Using a bit of intuition, this function should rise and fall very quickly at its ends, for example y = |√(x+5)|, and have a relatively high height inbetweentimes. This leads me to common naturally-occurring functions such as hyperbolae, catenaries or even quadratics! 

The first key bit of information needed is a method for calculating the length of a section of a curve. I have done this today, generalising the method up to an integral about halfway down; from there I took the quadratic example forward, since it turns out pretty neatly:

Tuesday 10 May 2016

Calculus in a simple resistance problem

The problem goes as follows: a resistor, resistance R, is connected in series with a battery, emf ε and internal resistance r; find the maximum power dissipation in the resistor R, and the resistance that results in this output.

Although it is quite clear how this can be solved with algebra, some thought about the theory behind Ohm's Law and Kirchoff's Second Law is required to set the answer in some context: since the power dissipation in the resistor is equal to I2R, one would think that raising the current is more influential on the output than raising the resistance. However, since the current drawn from the battery is intrinsically linked to the total resistance of the circuit, there is a delicate balance to be struck.
The nature of this balance is difficult to ascertain without calculation though, so without further ado:


The result is that the power dissipated across the resistor is a maximum (assumed from the context of the function whose stationary point is found, since only one positive-R stationary point exists) when R = r. This nicely helps to explain why a short-circuited battery is likely to heat up, set fire or explode - the short-circuit has a very low resistance, so overall a massive current is drawn which causes the wire to heat up. The wire will heat up the most when the resistance of the wire matches the internal resistance of the battery. The overall implication is that, if one was designing an electric resistance heater to emit the maximum heat output for any given emf, the length of wire coiled up should, via the resistivity equation, reflect the internal resistance of the power supply.

However, a separate issue is maximising the efficiency of the circuit in terms of energy - for environmental and financial reasons it is always of high priority to do this in power-hungry domestic appliances, and the heater is no exception. Since the efficiency of the circuit is the ratio of power dissipation in the battery to the total power dissipation of the circuit, maximum efficiency would be achieved when the resistance of R is infinitely high. The issue with this is, just like trying to connect a 12V power pack across a piece of plastic, you cannot expect any power to be generated at all - this is because the high resistance results in a tiny current being drawn from the battery and in any case, the power output would be highly suboptimal according to the result I derived above. The moral of the story for the heater is that the most important factor here is to cut the internal resistance of the power supply as much as possible.

If the heater is being connected to the mains power, this opens an entirely new can of worms, namely the definition of the National Grid's internal resistance. This is a combination of the cumulative resistance of electrical cabling across the country, inefficiencies in combustion of fuels in power stations, friction in turbine halls, and delayed responses in adjusting power distribution across the country following surges and drops in energy demand, amongst other more subtle factors. Nevertheless more simple domestic factors to consider are reducing usage of extension leads where possible, and keeping the heater cable well insulated so that its resistance doesn't itself rise with time!

Monday 9 May 2016

A curiosity - the reciprocal triangle

I was absent-mindedly solving simple calculus problems on Brilliant.org this afternoon when I stumbled across an interesting one about the reciprocal function. The question asked the user to prove that the area enclosed by the coordinate axes and the tangent to the curve at any point on the curve is constant, and to find this area. This is a basic proof of this unusual property:


The logical extension of this observation is to see what happens when the function is manipulated. First, imagine a reciprocal function of order n:


This sets nicely into context how unique the basic reciprocal function is - the k in the numerator will only be cancelled when n = 1.

Next, consider a function with a linear stretch factor a in the y-direction (which, due to the symmetry of the function, is the same as a stretch factor 1/a in the x-direction):


This is of mild interest too - the original problem took a = 1, such that the area of the triangle was 2.

Finally, consider a function translated by a units in the positive x-direction, and b units in the positive y-direction:


Clearly, as the sketch demonstrates, the translation of the curve disrupts the symmetry of the curve about the x and y axes, such that there is no longer a triangle of constant area. Trivially, it could easily be proven that there is a region of constant area, with the area under the tangent within the limits x > a and y > b, since these asymptotes effectively become the new x and y axes within the translated reference frame of the new curve. Furthermore it can be seen that the constant-area is reinstated when a = b = 0, such that k cancels out in the numerator and denominator to leave the previous area of 2.

Sunday 8 May 2016

The hanging cable - the intrepid part 2

I mentioned at the end of my post yesterday (see "The Hanging Cable") that I might be able to optimise the maximum height of the cable by adding a taper. The aim of today, aside from the fairly mundane past papers I had scheduled, was to model this as best I could. The result of the monster integral, which I have verified using Wolfram Alpha, is a complicated function in Hmax which I don't believe can be solved by conventional algebraic means. Nevertheless, the working is fairly satisfying, if I have made it clear at all, and there are several interesting qualitative conclusions that can be drawn from the model.

It is important to note that I have not repeated the first-principles derivation of this method below. For this, see here.



The function at the bottom is equated to zero to indicate how the required root is to be found. I have entered it into Desmos graphing calculator here, where it is easy to read off the x-intercept as the value of Hmax for the material. The sample data on it is for 2800 maraging steel; when a taper angle of 1° ( ≈ 0.0175c ) is selected, a clearly too large but a good arbitrary starting figure, the graph of y = f(Hmax) looks like this:


This shows that the theoretical maximum length of cable, fitting this specification, is 7288km, a pretty awesome distance. However, for a little perspective, let's consider how fat the cable will be at the very top, where it is hanging from its superlatively streadfast loop:

0.5d = Htanθ = 7.288·106 · tan(0.0175c) = 127553.0213 ≈ 128km (3sf)

If this doesn't seem unwieldy and inappropriate enough, next consider the volume and mass of this so-called 'cable':

V = 1/3πHmax3tan2θ = 1/3π · (7.288·106)3 · tan2(0.0175c) = 1.241705149·1017 1.24·1017m3 (3sf)
m = ρV = 1.24·1017 · 8100 = 1.005781171·1021 ≈ 1.01·1021kg (3sf)

This cable therefore makes up 0.0114% of the Earth's volume, enough to excavate the grand canyon nearly 30000 times (using the common estimate of 5.45 trillion cubic yards, where 1 yard = 0.9144 metres), and 0.0169% of the Earth's mass! Now, for the worst part, consider the estimated cost of this object, with the assumption that all the forges in the world could produce enough maraging steel between them. It is difficult to find a precise figure for the price of any particular variety, but in general the world steel price is about $60 per tonne, or 6 cents per kilogram:

Cost = 0.06·m ≈ $(6.06·1019)

This cost, just over 60 quintillion US dollars or 4.21 quintillion GBP, would be enough to pay off the £1.56 trillion UK deficit nearly 27 million times!

In essence what I am trying to illustrate is that a 1 degree taper is a truly ridiculous idea for extending the snapping length of the cable, even though it does a very good job of doing so. Let's see what happens as the angle is changed...

Since I am using graphical means to solve f(Hmax) = 0, I can see no way to easily find the equation for a graph of Hmax against taper angle. However some empirical experimentation with Desmos shows that the smaller the taper angle, the greater the value of Hmax. This is rather counterintuitive - the smaller the taper angle, the closer this model should come to the model established in the previous article.

Nevertheless I reckon I have found a possible flaw: since the cable starts with a radius of zero, having such a small taper means it is pretty much non-existent for the first few kilometres. I see this as analogous to the critical assembly of a system - just like ants can support weights disproportionate to their own mass due to their tiny size, having such a microscopic cable allows very disproportionate behaviours to occur in comparison with the macroscopic world. This idea doesn't entirely explain away the fact that the previous model completely decoupled the value of Hmax from the diameter of the cable, but I'm working on that bit! Perhaps the cable could instead be modelled as a frustum, with a significant radius at the bottom?